Gene TM1040_3051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3051 
Symbol 
ID4075145 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp20200 
End bp21609 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content60% 
IMG OID638004552 
ProductFAD linked oxidase-like 
Protein accessionYP_611287 
Protein GI99078029 
COG category[C] Energy production and conversion 
COG ID[COG0277] FAD/FMN-containing dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.89467 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGATA CGCCGACTGA CCGCCCCTCC TTCGATTGGG CCAATTTTGC CGCCAGCCTT 
TCGCCGGTCG AGGTGATTGA AGAACCTGTC CTGATCAAGA AACGCTCGCG GGATTTCTTT
TGGTACAGCC CGGTTCTGAA CGCCCAGCTG AAAAAGTGCT TTGGCGATCT GGTGGCGGTG
CCCCACACGA AGGATGAGAT GCGCCATTGT CTGCAGCGCG CCTATGAGGC CGATGTGCCT
GTGACACTGC GTGGCGGCGG CACCGGGAAC TACGGGCAGG CGGTGCCACT GCAAGGAGGG
TTGATCCTCG AAACCACCAA GATGAACCGC ATTCTGGAGA TTGGTGACGG CTATGTGCGC
GCTGAAGCGG GGGCCTTGAT GGCGGATGTC AATGCGGCGC TGATCGCGCA AGGCTGGGAA
ATGGCGATGT TCCCATCGAC ACAGGACATT GCCACCATTG GCGGGTTTGT GGCCGGCGGC
TCTGCCGGGA TCGGGTCGAT CGCAAATGGC GCCCTGCGTG AAAAAGGAAA CATCATGCAG
CTCAAGGCGT TTTCGCTTGA GGCAGAGCCG CAAGAGCATG TCTTTGACGC TGAAGACGCC
CTGCAGCTGC ACCATGCCTG GGGATTAAAC GGGGTCATCA CCGAAGTGAC GCTGCGCACC
GTGCCGCATC GCAACTGGAT TGGGTGCATG GCCACCTTTG ACAGCTATGA AGCGTGCTAT
GCTGCTGGCT ATGCGCTTGC TACCTCAACG CAGATTGGCC GCAAGCTGGC CAGTACGGTC
GAAGCCCGCA TTGTCGCCTA TTTCCCGCGC CTCAAGGATC ATTTGCGTGA AGGCAAACAC
CTGCTGGTGT CACTGGTTCC CGCCGAGGAC ATGGAGGCCC TGCGGGCACT GATAAAGGCG
CAGGGCGGCC ATCTGGATCT GGCGATGAGT GATGCCGAGC GGCAGGCGGC AAAGCTGCCG
CATGTTTTTG AATTCGCCTA CAACCACACC ACGCTGCAGG TCCTGAAGGC CGATCGCGCG
GCCACCTATC AGCAAATCGG GGTCCCGGAT CCTGCGGATG CGCGAGCGGT TGCGGCCGTG
GGTGCGGCCT TGGGCAATGA TGTCTGGCAG CACCATGAGT TCGCGCGGGT GGATGGCAAG
ATTGTCGCCT TTGACCTGCC GATCATCTGG TTCACCGATG AGGCACGGCT GCGCGAGATC
GACAAGACCT ATGAGGCGCA CGGCCACAGC GTCGCCGATG CGCATACCTA TTTCGTCGAG
GGGGGCGGGC TGAAGAATGC CGATTATCGC CACCTGGCGT GGAAGAAACG CATGGACCCC
AAGGGACTGC TGAATCCTGG CAAGTCACGG GCCTGGGAGG AGGTAAAACA CCTCCCCGCT
GAGGAAATCG AAGCAAAGGC AAAGGGCTGA
 
Protein sequence
MTDTPTDRPS FDWANFAASL SPVEVIEEPV LIKKRSRDFF WYSPVLNAQL KKCFGDLVAV 
PHTKDEMRHC LQRAYEADVP VTLRGGGTGN YGQAVPLQGG LILETTKMNR ILEIGDGYVR
AEAGALMADV NAALIAQGWE MAMFPSTQDI ATIGGFVAGG SAGIGSIANG ALREKGNIMQ
LKAFSLEAEP QEHVFDAEDA LQLHHAWGLN GVITEVTLRT VPHRNWIGCM ATFDSYEACY
AAGYALATST QIGRKLASTV EARIVAYFPR LKDHLREGKH LLVSLVPAED MEALRALIKA
QGGHLDLAMS DAERQAAKLP HVFEFAYNHT TLQVLKADRA ATYQQIGVPD PADARAVAAV
GAALGNDVWQ HHEFARVDGK IVAFDLPIIW FTDEARLREI DKTYEAHGHS VADAHTYFVE
GGGLKNADYR HLAWKKRMDP KGLLNPGKSR AWEEVKHLPA EEIEAKAKG