Gene TM1040_0447 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0447 
Symbol 
ID4076090 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp465297 
End bp466481 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content62% 
IMG OID638005743 
Productextracellular ligand-binding receptor 
Protein accessionYP_612442 
Protein GI99080288 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.766827 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTTTA TGTTTGCTGT TTTCAATCGG GCCCGCAAGA TTGCTTTGAC CGCCACCGCA 
GTTGTTGCCT CTGCATTTGT CGCCGCCTGT GACCCCGGAG CATTGTCCGG CGGGGGGCCG
ACCATCAACA CCTCCAAACC TGTGCCTGTC GCACTGCTGG TACCGCGCGG TTCGGCACAG
CATGGTGATG GCGTTCTGGC CCAGAGCCTT GAAAATGCAG CCCGCCTCGC GATTGCGGAT
CTGAACGGCG TAGAGGTGGA CCTGCGTGTC TATGACACAG CGGGCAACCC TGAAACCGCC
GCTGCCGTTG CCTCGCAAGC GGTGCAGGAC GGCGCGCGCA TCATTCTTGG TCCGGTCTAT
GCCGAAGCCG CAAATGCCGC AGGGATTGCC GCCGCAAAGC GCGGTGTGAA CGTGCTGGCC
TTCTCCAACA ATGCCTCGAT CGCGGGCGGC AACGTGTTCG TTCTGGGCTC GACCTTTGAG
AACTCCGCCA ACCGCTTGAC CCAATATGCC AAACGCCAGG GCAAGAACTC CATGGTGGTT
GTGTCGGGCA ATAATGCCGC CGGACAGGCC GGGCGTTCTG CCATTCAGCA GGCCGCCGTG
CAGAGTGGCA TGACCATTAC GGGCAACGTC AGCTATGAGC TGTCGCAGCA GGGTGTGATC
AACGCGATCC CGACCATTAG CCAGAATGTG CGTCAGAACA AAGCGGACGT GATGTTCATG
ACCGCGACCA CCGCAGGCGC GCTGCCGCTG TTGTCGCAGC TGTTGCCCGA AGCCGGTGTC
ACGCCAGAAG ACGTGCAGTA CATGGGCCTG ACCCGTTGGG ACATCCCCGC GCAGACGCTT
GAACTGCCCG GAGTTCAGAA CGGCTGGTTC GCCCTGCCCG ACCCACAGAA GTCTGCCTCT
TTCCGTGCAC GTTATCAATC CGCATATGGC GCGGCACCGC ACCCGATCGG TGGTCTGGCC
TATGACGGGA TCGCCGCCAT TGGCGCGCTG GTCAGCTCTG GCAACTCCGG GGCGCTCACC
GGTGCGGCTC TGACACAACC CGCAGGTTTC CAGGGCACAG GGGGTATTTT CCGCCTGCGC
CCGGATGGCA CCAGTGAACG TGGTCTCGCC ATCGCAACGA TCCAGGACAA GAAAGTCGTC
ATCATTGACC CAGCGCCACG AAGCTTCCCC GGAGCCGGTT CCTGA
 
Protein sequence
MRFMFAVFNR ARKIALTATA VVASAFVAAC DPGALSGGGP TINTSKPVPV ALLVPRGSAQ 
HGDGVLAQSL ENAARLAIAD LNGVEVDLRV YDTAGNPETA AAVASQAVQD GARIILGPVY
AEAANAAGIA AAKRGVNVLA FSNNASIAGG NVFVLGSTFE NSANRLTQYA KRQGKNSMVV
VSGNNAAGQA GRSAIQQAAV QSGMTITGNV SYELSQQGVI NAIPTISQNV RQNKADVMFM
TATTAGALPL LSQLLPEAGV TPEDVQYMGL TRWDIPAQTL ELPGVQNGWF ALPDPQKSAS
FRARYQSAYG AAPHPIGGLA YDGIAAIGAL VSSGNSGALT GAALTQPAGF QGTGGIFRLR
PDGTSERGLA IATIQDKKVV IIDPAPRSFP GAGS