Gene TM1040_1821 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1821 
Symbol 
ID4076967 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1915660 
End bp1916616 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content58% 
IMG OID638007136 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_613816 
Protein GI99081662 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0601] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.054126 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.742084 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCCGG TGATCAAACT ATTGGCTCAG CGCATTGCGC TGAGTCTCTT CCTGCTGTTG 
ATGATCTCGG CGCTGATCTT TGCGGGCACC ATGATCCTGC CCGGCGACGT CGCCCAGTCC
ATCCTCGGCC AATCCGCAAC CCCCGAGGCG CTCGCCAACC TGCGCGCCGA GCTTGGACTG
AACGAACCCG CCCTCACGCG CTATTTCGAC TGGCTTTTTG GTGCCTTGCA AGGCGATCTT
GGCACCGCTC TGACCTCTGG TCAGGACATC ACTTCCGCCA TTGGATCGCG CCTCTCGAAC
ACGCTGTTCC TTGCGTTCTG GGCCGCTGTG ATCGCCGTGC CGCTGGCGAT TTTCCTCGGT
CTTCTGGCCG TGCGCTACAA AGATCGCTGG CCGGACAAGC TGATTTCCGG CGTCACATTG
GCGTCGATCT CCATTCCTGA ATTCCTGATC GGTTACGTTC TGATGTATCT GATCGCCGTA
AAACTGCGCT GGTTTCCCTC TGTCGCCATG ATCAACGACG GCATGAACCT GTGGCAGAAG
CTCAATTCCA TCGCCCTGCC CGTCGCGGTG CTGACGCTTG TGGTGCTTGC CCACATGATG
CGCATGACCC GTGCGGCGAT CCTCAACGTG ATGCAGTCCG CCTATATCGA GACTGCGGAA
CTCAAGGGGC TTTCGACCTT CAAGGTGATC TGGCGTCATG CCTTCCCCAA CTCGATCGCG
CCGATCGTGA ATGTGGTGAT GCTGAACCTC GCCTATCTTG TGGTTGGTGT CGTGGTGATC
GAAGTGGTCT TCGTCTATCC CGGCATGGGG CAATATCTGG TGGATCATGT CTCCAAACGT
GACGTGCCGG TGGTGCAGGC CTGTGGTCTG ATCTTTGCCA CCGTCTATAT CGGCCTCAAC
ATGGTTGCCG ATATCGTGTC GATCCTGTCG AACCCGCGTC TGAGGCATCC GAAATGA
 
Protein sequence
MNPVIKLLAQ RIALSLFLLL MISALIFAGT MILPGDVAQS ILGQSATPEA LANLRAELGL 
NEPALTRYFD WLFGALQGDL GTALTSGQDI TSAIGSRLSN TLFLAFWAAV IAVPLAIFLG
LLAVRYKDRW PDKLISGVTL ASISIPEFLI GYVLMYLIAV KLRWFPSVAM INDGMNLWQK
LNSIALPVAV LTLVVLAHMM RMTRAAILNV MQSAYIETAE LKGLSTFKVI WRHAFPNSIA
PIVNVVMLNL AYLVVGVVVI EVVFVYPGMG QYLVDHVSKR DVPVVQACGL IFATVYIGLN
MVADIVSILS NPRLRHPK