Gene TM1040_1532 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1532 
Symbol 
ID4075830 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1636457 
End bp1637821 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content63% 
IMG OID638006845 
Producthypothetical protein 
Protein accessionYP_613527 
Protein GI99081373 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.340629 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCAGG ATATTTTTGG ACAAGAGTGC AGCCTCACCG ATCCCCGCGC ACTCGCGGAA 
TGGAACGGCG CGCAGACCGG CGTTCTTGCC CATGCGGCGC GCACCGCTGG CCACCTTGGG
GCGGTGTTGG ACGCCGCGCC CGATTTTGCC CTCGGTCAGG CCATCAAGGG GCTGTCGCTG
TTGATGCTTG GGCGCTCCGA ACTCTTGCCG ACCGCGCGCG AGGCGCTGGC TATTGCCAAG
TCGACCTATG AAGGCGCGTT GCCGCGAGAA CGCAAATATG TCGACGCATT GGAAGCCTGG
CTCTCGGGTC ATCCCTCGCG CGCGATCACC TGCATGGAAG ACATCCTGAC CCGGCATCCC
TGCGACACGC TGGCGATGAA ACTCAGCCAT GGGATCCGCT TTATCATGGG CGACGCCCGC
GGCATGCGCG CCTCTATCGA GCGTGTGCTG CCCGCCTATT CGACCGAGCA CGCAGGACAT
GGCTATCTGT TGGGCTGTCA CGCCTTTGCC CTCGAGGAAA CCGGCGACTT CGACCGGGCG
GAAATCACCG GACGTCAGGC ACTTTGGACC GCGCCGGACG ATGCATGGGG GCTGCATGCG
GTGGCACATG TGCATGACAT GACCGGAAAT GCCAGGACTG GATTGGGTTG GCTTGAGGGC
CGCGAAGAGG CCTGGGCGCA TTGCAACAAT TTTCGCTACC ATGTGTGGTG GCACAAGGCC
CTGATGCACC TCGACCTCGG CCAAATCGAC GAGGTGATGC GGCTCTATGA TGATGAGGTG
CGCAAGGACA AGACCGACGA CTACCGCGAC ATCTCCAATG CCACCTCGCT GCTGATGCGG
CTCGAACTTG ATGGGGTCAA TGTCGGAGAC CGCTGGGACG AGCTGGCAGA GCTTTGCGAA
AACCGAACTG AGGACGGCAG CCTCATCTTT GCCGATCTGC ATTACCTGCT GGCGCTGATC
GGCGGCGATC GCGCAACGGC CACAGGCCAG CTGATCCGGC GGATCCATGC CGATGGAACC
CAGCCCAAGA CCGAAGCCGC GCAAAGAATG GCCGACCCGG GCTGCGCGGT GTCAAAGGGA
CTTGAGGCTT TTGGGGAAGG CCACTACGGC ACAGCCTTCG ACTACCTCGC TAAGTCACGG
GATTCGTTGC AACTTGCAGG TGGCAGCCAT GCCCAGCGGG ACGTGTTTGA ACGCATGACC
ATCGACGCTG GGCTGCGCTC GGGGAACTGG GCACAAGTGG AGGCCATTTT GGATGACAGA
CGTGCCAAAC GCGGAGGGGG CGAAGACAAT TATGCGATGG CCCGTCGCGC CTTGATTGCG
GCCGCCCAAA GCGAGGGCGG CGCACAGAGC GTCCCGGCGG AGTGA
 
Protein sequence
MTQDIFGQEC SLTDPRALAE WNGAQTGVLA HAARTAGHLG AVLDAAPDFA LGQAIKGLSL 
LMLGRSELLP TAREALAIAK STYEGALPRE RKYVDALEAW LSGHPSRAIT CMEDILTRHP
CDTLAMKLSH GIRFIMGDAR GMRASIERVL PAYSTEHAGH GYLLGCHAFA LEETGDFDRA
EITGRQALWT APDDAWGLHA VAHVHDMTGN ARTGLGWLEG REEAWAHCNN FRYHVWWHKA
LMHLDLGQID EVMRLYDDEV RKDKTDDYRD ISNATSLLMR LELDGVNVGD RWDELAELCE
NRTEDGSLIF ADLHYLLALI GGDRATATGQ LIRRIHADGT QPKTEAAQRM ADPGCAVSKG
LEAFGEGHYG TAFDYLAKSR DSLQLAGGSH AQRDVFERMT IDAGLRSGNW AQVEAILDDR
RAKRGGGEDN YAMARRALIA AAQSEGGAQS VPAE