Gene M446_1339 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_1339 
Symbol 
ID6135470 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp1472995 
End bp1474233 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content69% 
IMG OID641641618 
ProductABC transporter substrate binding protein 
Protein accessionYP_001768289 
Protein GI170739634 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.358784 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.213428 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCATCA GGAGATTGCT GCGCGCGCTC GGCGCGTGCG CGGCCGCCGT CGCGGCCGGC 
CCGGCGCTCG CGCAGCCGGG GCCCTTGCCG GAGCCCTTGG CGGTTCGGAT CGGGGTGCTG
ACGGATCTCT CGAGTCTCTA CGCGGACACC AACGGCAGTG GCGCCGTGAC GGCGGCCCGC
ATGGCGGTCG AGGATTACGA GGCGGCGGGC GGCACGGTGA GGGCCGAGGT CGTGTCCGCC
GACCACCAGA ACAAGGCCGA TATCGGCTCG ACGATCACGC GGGAATGGTT CGACCGGGGC
GGCGTGGACG TCGTCGTGGA CGTGCCGAAC TCGGCCGTGG CCCTCAGCGT GACGGAGCTG
GCGCGTCAGA AGAACAAGGT CTTCATGAAT TCGGGTGCGT CCTCGTCCGA TTTCACGGGC
AAGAACTGCA CCCCCAACAG CATTCACTGG ACCTACGACA CCTACGCGCT CGCGACCGGC
ACCAGCCGGG CCGTGGTGGC CTCGGGCGGG GATTCCTGGT ACCTGCTGAC GGCGGATTAC
GCGTTCGGCC ACACGATGGA GGCCGACATC CGCAAGATCC TCTCGGCCTC GGGCGCCACG
GTGGTGGGCA GCGCCCGCAC GCCCCTCAAC ACCCCCGACT TCTCGTCCTT CCTGCTGCAG
GCGCAGGCCT CGAAGGCCAA GGTGGTCGGG CTGGTCAACG CGGGCGGCGA CACGATCAAC
GCGATCAAGC AGGCGTCCGA GTTCGGCATC GTGCAGGGCG GGCAGAAGAT CGCCGCCATG
GTGCTCTACA TCACGGACGT GCACTCCCTC GGCCTCGGCG TCGCCCAGGG GCTGCAGTTC
ACGGCCGCCT ATTACTGGGA TCTCAACGAC GGCACCCGGG CCTTCGCCAA GCGGTTCTCC
GCGCGCATGG GCGGGCGCAT GCCGACGCAG CTCCAGGCCG GCGCCTACTC GGCGACGCTG
CACTACCTCA AGGCCGTCGA GAAGGCCGGG ACGAAGACCG ACGGCAAGCG GGTCGTCGAG
GTCGCCAAGA GCCTGCCGAC CGATGATCCG GCCTTCGGCA AGGGGACGAT CCGGGCCGAC
GGGCGCAAGA TGCACAACAT GTACCTGTTC GAGACGAAGA CGCCGTCCGA ATCGAAGGGG
CCCTGGGATT ATTACAAGCT GATCAGGACC ATCCCGGCCG AGGAGGCGTT CCGGCCGATG
GGAGAGGGCG ACTGCCCGAT CGTCACCGGC TCGCGCTGA
 
Protein sequence
MGIRRLLRAL GACAAAVAAG PALAQPGPLP EPLAVRIGVL TDLSSLYADT NGSGAVTAAR 
MAVEDYEAAG GTVRAEVVSA DHQNKADIGS TITREWFDRG GVDVVVDVPN SAVALSVTEL
ARQKNKVFMN SGASSSDFTG KNCTPNSIHW TYDTYALATG TSRAVVASGG DSWYLLTADY
AFGHTMEADI RKILSASGAT VVGSARTPLN TPDFSSFLLQ AQASKAKVVG LVNAGGDTIN
AIKQASEFGI VQGGQKIAAM VLYITDVHSL GLGVAQGLQF TAAYYWDLND GTRAFAKRFS
ARMGGRMPTQ LQAGAYSATL HYLKAVEKAG TKTDGKRVVE VAKSLPTDDP AFGKGTIRAD
GRKMHNMYLF ETKTPSESKG PWDYYKLIRT IPAEEAFRPM GEGDCPIVTG SR