Gene TM1040_1984 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1984 
Symbol 
ID4077168 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2088076 
End bp2089935 
Gene Length1860 bp 
Protein Length619 aa 
Translation table11 
GC content63% 
IMG OID638007299 
ProductABC transporter related 
Protein accessionYP_613978 
Protein GI99081824 
COG category[V] Defense mechanisms 
COG ID[COG1132] ABC-type multidrug transport system, ATPase and permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.434554 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTACGC AATACATGTC ATCGCGCAAA TTGTTGCGCT GGCTCTGGCG CGGCTACCTG 
CGGCACCACG TCGGACTATT GGCGATTGCT GTCTTCTTCA TGCTGCTCGA GGGCGCCTCC
GTGGGCGGGC TCAGCTACAT GATGCAGCCG ATGTTTGACC TCGTGTTTGT GGACGGCAGC
GAAAACGCCC TCATCTGGGT GAGCGTTGCG TTTTTTGCCA TCTTCGTGCT GCGCGGCTTC
AGTTCGGTAA CGCAGCGGGT GATCCTCGCC ACGATCAACC AGCGGTCCGA GGCGCATATG
CGCACGGATA TGCTCGAACG GCTGATCCAT CAGGACGCGA GCTTTCACCA GCATCACCCG
CCCGGCTTCT TGATCCAGCG CGTGCAGACT GACGTCGCGG CGATCAACCA GGTCTGGCAG
GCGATCATCA CCGGCTCGGG ACGCGATGTG GCGCAGCTTG TTGCAGTGCT GACCGTCGCT
ATCAGCGTCG ACTGGCGCTG GACCCTGATC ATGCTGATCG GCGTGCCGCT TCTGGTGTTG
CCGCTCTCGA TCGTGCAGCG GTATGTCCGC AAGAAAGCCA GCCTTGCTCG CGACCTTGGC
GCAGTGCAGG CGACCCGTCT CGACGAGGTC TTCCACGGTA TCGTCCCGAT CAAGCTCAAC
CGGCTTGAGG ACTACCAGAG CGGGCGTTAC CGCTCGGCAA CTGCGGCCTT TGTCCGGGCG
CAGATCAAGG CGGCCCTCGG CGCCTCCTCG ACCACGGGGA TGGTGGATCT GATGGCTGGT
TTCGGGGTCA TGTGCGTGAT CCTCTTTGGC GGGCGCGAAA TCATCGACGG CGACAAGACC
GTCGGCGAAT TCATGAGCTT TTTCACCGCC ATCGGCCTCG CCTTTGACCC GATGCGCCGC
CTCGCCGCTA TCATGGGCAT CTGGCAGGGC GCCGCCGCCG CGATGGAGCG GATCAAGGAG
CTGATGGATG AACCCATCAC GCTCGTGTCC CCGGAACACC CCAAGCCCGC CCCCAAAGGG
CTGCCCGAAA TCCGTCTTGA TAACGTGAAC CTCTATTATG GCGAGGCGCA TATCCTGCGC
GATCTGTCAC TGGTGGCCGA AGCCGGCAAG ACCACCGCGC TGGTGGGCGC CTCCGGTGCC
GGGAAATCCA CCATTTTCAA CATCCTGACC CGGCTTGTTG ATCCCCAGAC CGGCTCTGTG
ACCCTGGATG GCACTGAGGT GCGGGATCTT GATCTGGGCG ATCTCAGGGA TCTCTTTTCC
GTGGTCACCC AGGACGCGCT GCTGTTTGAC GAGACGCTGC GCGAGAACAT CCTTCTGGGG
CGCACGGATG TGAGCGAAGA ACGCCTTGCA GAGGTTCTGG ATGCCGCTCA TGTGTCGGAC
TTTCTGCACA AGTTGCCCGA AGGGCTCGAA ACCCGCGTGG GCCCGCGCGG CTCGGCGCTC
TCGGGGGGGC AGCGTCAACG CGTGGTGATC GCCCGCGCGC TCTTGCGGGA CACGCCGCTC
CTGCTGTTGG ACGAGGCCAC CTCGGCGCTG GATGCTCAAT CCGAGAAAGT GGTGCAAAAG
GCGCTCGAGA AACTCTCTGG TGGGCGCACG ACAATCGTGA TCGCGCACCG TCTTTCGACC
ATCCGCTCGG CGGACAAGAT CGTGGTGATG GAGCGCGGCC GCGTGATGGA TCAGGGCCGC
CACGAGGAGC TGCTGGAGCG CGGCGGGATC TATGCAGATC TCTATCGTTT GCAGTTCCAG
GACGGGAAAA CCGTGATCGA CACCGATGGG ATGAACGCCC AGATCGCGCA GAACGACAAT
CGCAACGCAC GCGAGGAAAC CGGCCTTCTA CGCCGCTTTG CCCGCCGCCT CTTTGGCTGA
 
Protein sequence
MSTQYMSSRK LLRWLWRGYL RHHVGLLAIA VFFMLLEGAS VGGLSYMMQP MFDLVFVDGS 
ENALIWVSVA FFAIFVLRGF SSVTQRVILA TINQRSEAHM RTDMLERLIH QDASFHQHHP
PGFLIQRVQT DVAAINQVWQ AIITGSGRDV AQLVAVLTVA ISVDWRWTLI MLIGVPLLVL
PLSIVQRYVR KKASLARDLG AVQATRLDEV FHGIVPIKLN RLEDYQSGRY RSATAAFVRA
QIKAALGASS TTGMVDLMAG FGVMCVILFG GREIIDGDKT VGEFMSFFTA IGLAFDPMRR
LAAIMGIWQG AAAAMERIKE LMDEPITLVS PEHPKPAPKG LPEIRLDNVN LYYGEAHILR
DLSLVAEAGK TTALVGASGA GKSTIFNILT RLVDPQTGSV TLDGTEVRDL DLGDLRDLFS
VVTQDALLFD ETLRENILLG RTDVSEERLA EVLDAAHVSD FLHKLPEGLE TRVGPRGSAL
SGGQRQRVVI ARALLRDTPL LLLDEATSAL DAQSEKVVQK ALEKLSGGRT TIVIAHRLST
IRSADKIVVM ERGRVMDQGR HEELLERGGI YADLYRLQFQ DGKTVIDTDG MNAQIAQNDN
RNAREETGLL RRFARRLFG