Gene TM1040_3328 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3328 
Symbol 
ID4075733 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp337576 
End bp339660 
Gene Length2085 bp 
Protein Length694 aa 
Translation table11 
GC content62% 
IMG OID638004836 
Producthypothetical protein 
Protein accessionYP_611562 
Protein GI99078304 
COG category[R] General function prediction only 
COG ID[COG3008] Paraquat-inducible protein B 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.901171 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.708038 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCCCA AAGAGCCTGC ACAGCTAAAG ATTTCAGATC AACGCCCTTC AATGTGGCGC 
AACCTCTCAC TGGTCTGGCT CGTGCCCCTG CTGGCCCTGA TCGTATCGCT TGGCATCGCC
TGGAAGAGCT ATTCAGAGCG CGGCGAGTTA ATCCACATCG CTTTTGACAA TGGCTCTGGG
GTGGTCGCCA ATGAAACCAC GCTGCGCTAT CGCGATGTGG TGATTGGCCG GGTGGAGCAG
GTCGGCTTCT CCGAGGACCT GAGCCGTGTT TCAGTTGCCG TTCGCGTCGA CACGGACATC
GCTCCCTTTC TGGATGAAGA TGCGACCTTC TGGGTGGTGC GCCCGGAGGT GAGTTCGCGC
GGGATCTCGG GGCTGAGCAC GGTGCTGTCC GGCGTCTATA TCGAAGGCGC CTGGGACGAG
ACCGCCGGAT CCGCGCAGAC CGAATTTGTG GGGCGGGATC GCCCCCCTCT GGTGCAGCCC
GGACGCGCCG GGCGCCGGAT TACCTTGCGG ACCGAAGATG GCAGGATGAT CACCGAGGGC
GCGCCTGTCC TGTTTCGCGG CATCGAAGTG GGCCGCCTTG AACACCCCCG CCTGACCGTA
AGCGGCAACA GCATCGTGGT GGATGCCTTT ATCGAGGCAC CGCACGATCG CCGGATCAAC
TCGGCCACGC GCTTTTGGGA TACCTCCGGC TTTACCGTCT CCTTTGGAGC GGAAGGGCTG
TCTTTGGATG TCGACAGCCT TGCATCCCTT GTTTCCGGCG GGATCGAATT CGACTCTGTC
TTTGATGGCG GCAGCCCGGC GAACCCCGGC GCGGTCTTTG ACATCTATCC AGATGAGGCC
ACCGCAAGGC GCACTGCTTT TGCGCGTTCC ATGGCCGGTG GCGTTGCGAT CTCTGTAGCG
TTTGACGACT CTGTTGCGGG TCTGAGCAGT GGTGCACCCG TAGAGCTCGG CGGTATCAAG
GTAGGCGAAG TCGGCAGCCT CACCGCCGCG ATCCAGAATG AGACCGACAC GGCAGATGTC
AAACTGGTGG CCAAACTCCT GCTGCAACCC GGATTGCTGG GTCTTGCGCC AGGTGCCAAC
GAAGAGGACG TTTTGGACTT TCTTGAGACC GCTGTTGAGG GTGGAATGCG GGCGCGTCTC
GCTTCTGCCG GACTCCTGAG TTCAGAGCTG ATGGTAGAAC TGGTGCGCCT CGATGATGTC
CCTCCCGCCA GATTTGATCG CACAGCAGAG CCCTTCCCGG AAATGCCAAG CGCGCCATCG
GATCTGCCTG ATTTCTCTGC AACAGCAGAA GGCGCGATGG AACGGATCAG CCAGCTTCCG
GTGGAGGAAT TGATGGCGCA GGCGATCAAT ACCTTGGCCA GCATCGAGGC TCTGGCCGCC
GCCGAAAGCA CCCGGCAGGC CCCGGCAGCT GCTGTGGCTC TCTTGGAAGA AACCCGCGCC
CTGCTGAACG ATCCAAGCAC ACGCGCCCTG CCCGGAGAGC TGCGCGCCAC CGTCTCAGAT
CTGCGTGCAA TCCTGCAGGA GTTGCAACAG GGACGCGCCG TTGCGAACCT GACCGCTGCG
CTTCAAGATG GGGCGCGGGC CGCAGATGAG ATCGCGACAG CCTCAGAGAA TCTGCCCGCG
TTGGTGGACG ATCTGCGCGA CCTTGCCGCC AAAGCCAACA ACCTTGAAGC GGAAGAGCTG
ATCCAATCGG TCAACACGCT GATGACCAGT GCAGATGCGC TGATTGGCAC AGAGGCGGCT
CGTGCGCTTC CAAGCTCATT GTCGGCGGCG CTCGAGGAAA TCCGCACCAC TCTCGCGACA
CTGCGCGAGG GCGGATTAGT CGACAACGCA AATGCCACCA TGGTCTCGGC CCGCGATGCC
ACGCAATCGG TTGCGCAAGC CGCAGAAGGC CTGCCCGCGC TCTCCGCGCG GCTGGAGCGC
CTCATGACCC GGTCAGAATC GCTGGTGGCG GCCTATGGGG ATCGCTCCAA TTTCAATATG
GAAACGCTCG ATATGCTCCG CGAAATCAAA TCCGCAGCGC GCGCGGTCTC GCAGCTTGCC
CGCAAGATCG AACGGGATCC GAACTCACTG GTATTTGGTA GATGA
 
Protein sequence
MNPKEPAQLK ISDQRPSMWR NLSLVWLVPL LALIVSLGIA WKSYSERGEL IHIAFDNGSG 
VVANETTLRY RDVVIGRVEQ VGFSEDLSRV SVAVRVDTDI APFLDEDATF WVVRPEVSSR
GISGLSTVLS GVYIEGAWDE TAGSAQTEFV GRDRPPLVQP GRAGRRITLR TEDGRMITEG
APVLFRGIEV GRLEHPRLTV SGNSIVVDAF IEAPHDRRIN SATRFWDTSG FTVSFGAEGL
SLDVDSLASL VSGGIEFDSV FDGGSPANPG AVFDIYPDEA TARRTAFARS MAGGVAISVA
FDDSVAGLSS GAPVELGGIK VGEVGSLTAA IQNETDTADV KLVAKLLLQP GLLGLAPGAN
EEDVLDFLET AVEGGMRARL ASAGLLSSEL MVELVRLDDV PPARFDRTAE PFPEMPSAPS
DLPDFSATAE GAMERISQLP VEELMAQAIN TLASIEALAA AESTRQAPAA AVALLEETRA
LLNDPSTRAL PGELRATVSD LRAILQELQQ GRAVANLTAA LQDGARAADE IATASENLPA
LVDDLRDLAA KANNLEAEEL IQSVNTLMTS ADALIGTEAA RALPSSLSAA LEEIRTTLAT
LREGGLVDNA NATMVSARDA TQSVAQAAEG LPALSARLER LMTRSESLVA AYGDRSNFNM
ETLDMLREIK SAARAVSQLA RKIERDPNSL VFGR