Gene TM1040_3418 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3418 
Symbol 
ID4075592 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp439560 
End bp440591 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content61% 
IMG OID638004927 
ProductApbE-like lipoprotein 
Protein accessionYP_611652 
Protein GI99078394 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.999205 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGAACC CTCTTCTCCT TTCTCGCCGT AGCTTTATGG TCATGCCTCT TGCCCTGATG 
GCCTGCAAGA AAGGGTGGTC GCTGTTGGAA CTCAGCGGGC TGACGATGGG GACAAGCTAC
TCGATCGTTG CGATCGATCA CAGCAAATCC GTCGAGAAAG CAGAGCTGCA GGCGGCTGTC
GACAAGGCGC TGGCGCAGGT CAACGTGCAG ATGTCCAACT GGGATGCCGC GTCCGAGGTG
TCGCAGTTCA ACGCGCTGGC CGCTGGCGAG AGCCTGTCGG TCTCTGGCGA GCTGCATCAT
GTGATGCAGG CTGCGCAGGA CGTGCATTTT GCAAGCGACG GTGCGTTTGA CGTGACCGTG
GGTGGTCTCA TCGATCTTTG GGGCTTTGGC GCGGGTCAGA CCCGCAGCGA TCTTCCCTCC
GAGGCCGAGA TTGCGGCTGC CATGGGGTGC TGCGGTCAGG CGCAGTCGGT CGAGCTTGAG
GCCGGTGGCC TCAAGAAGCT TAACGCTGGC GCCGAGGTCT ATCTGTCCTC CATCGGAAAA
GGGTTCGGTG TTGATCAGCT GGCCCGAGTC CTCAAGGGCT ATGGCATCAC CGACTACATG
GTCGAGATCG GTGGCGATCT CTACACCGCT GGGCGCAACC CCGATGGTCA GCCCTGGCAG
ATCGGGATTG AGACGCCCGA GGCTTTTGAC CGTGGCGTGA CCCAGGTGGT TGGGCTTTCT
GACATGGGCA TGGCCACCTC TGGCGATTAC CGCAACTTCT TTGACGTCGA TGGCAAGCGC
TACTCGCATA TCATCGACGC CACCACGGGC CGTCCGGTGG AACACGACAC CGCGTCTGTC
ACCGTTCTGA CCGACAACGC GATGCTGGCG GATGCATGGG CGACAGCGAT GCTGGTGCTG
GGTCGCGAGC GCGGTCTCGA GATCGCAAAT CAGCGTGATC TCGCGGTTCT GTTCCTTGAT
CGTGCGGTTG CAAATGGCGA CAATGGGTTT ACCTCTGTCG CATCAAACCG TTTCGAGGCG
CTGACCGCGT AA
 
Protein sequence
MSNPLLLSRR SFMVMPLALM ACKKGWSLLE LSGLTMGTSY SIVAIDHSKS VEKAELQAAV 
DKALAQVNVQ MSNWDAASEV SQFNALAAGE SLSVSGELHH VMQAAQDVHF ASDGAFDVTV
GGLIDLWGFG AGQTRSDLPS EAEIAAAMGC CGQAQSVELE AGGLKKLNAG AEVYLSSIGK
GFGVDQLARV LKGYGITDYM VEIGGDLYTA GRNPDGQPWQ IGIETPEAFD RGVTQVVGLS
DMGMATSGDY RNFFDVDGKR YSHIIDATTG RPVEHDTASV TVLTDNAMLA DAWATAMLVL
GRERGLEIAN QRDLAVLFLD RAVANGDNGF TSVASNRFEA LTA