Gene Dtpsy_1063 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtpsy_1063 
Symbol 
ID7382872 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidovorax ebreus TPSY 
KingdomBacteria 
Replicon accessionNC_011992 
Strand
Start bp1105507 
End bp1106595 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content70% 
IMG OID643654378 
ProductApbE family lipoprotein 
Protein accessionYP_002552540 
Protein GI222110276 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTCAG ATCGGTGCAT GCCGGCGGTA GGCAACCGGT GGACGCGGCG GCGCTTCGCC 
CTTGCGCTGC CGTTGCTGGG CGCGCTGGCG TACCTCCACC CGCGCCAGGC TTTGGCTGGC
ATGGTGGAGG GCGCGCAGCC CCTGGTGCGC GCGAGCCGTA CCCTCATGGG CACACGCGTG
GACATCGCGG CTGCCGTTGG CAATGGCCGC GATGCCGGCG CTGTGCAGCA GGCGATGCTG
CACGCATTCG CGGAAATGGA GCGCCTGGAG GCGCTCATGA GCCGTTACCG GGAGGGCAGC
GACGTGGCGC GGCTCGGCGC GGCCGCCGGC CGGCATGCCG TGCACGTGGC CCCGGAAGTG
ATGCAGGTGC TGCGCACGGC GCGTCGCCTG CACCAAGAAA GTGCTGGCGC CTTTGATCCC
ACCGTCGGTG CACTGCGGGG CTGGCATTTT GAGCCCGGCC ACGAAGCCGC GCCGGCACCT
GAGGAGATCG CTCAGGCGCT GCGTTTGGTG AACGCGCGCC ACCTCGTGCT GGACGAGCGC
GCAGGAACGG CCTACCTTGC GCGGCCGGGC ATGGGGTTGG ATCTGGGCGG CGTGGCGAAG
CTACCTATTT TGCAGGCGGG CTTGCAAGTG CTGGAGCGCG CTGGCGTCAC GGATGCGCTG
GCCAACGGCG GTGGCGATGT CCTGGTCATG GGCCGGCAGC ACGACCGTCC CTGGCGTGTG
GGCGTGCGCA ATCCCTCCGC TCCGGCGCAG CTGCTGGGCG TACTGGAACT GCAAGGGCGC
GGCGTGGTGG CATCGTCCGG CGACTACGAG CGGGGCTTCC TGCGTGCAGG ACGCCGCCTG
CACCATGTGC TCAACCCCCG CACGGGTTGG CCTACGGAAG GTGTGTCTGG CGTGGCGCTC
ATGGCCGAGC GTGTTGAAGA CGTCAACGGC TGGGGCACGG CGCTGATGGT GCAAGGGGCT
GCGGCCGCAC CTGCATGGCA TGCGGACCAC GCACATGTCG AAGCCCTCGT GGCGAGCGCT
GATGGCACGC CCTGGAGTTC CCCTGGAATG CTCGCCGCGC TGCGGCCAGT GCCGGCGCGC
GCAGGATGA
 
Protein sequence
MNSDRCMPAV GNRWTRRRFA LALPLLGALA YLHPRQALAG MVEGAQPLVR ASRTLMGTRV 
DIAAAVGNGR DAGAVQQAML HAFAEMERLE ALMSRYREGS DVARLGAAAG RHAVHVAPEV
MQVLRTARRL HQESAGAFDP TVGALRGWHF EPGHEAAPAP EEIAQALRLV NARHLVLDER
AGTAYLARPG MGLDLGGVAK LPILQAGLQV LERAGVTDAL ANGGGDVLVM GRQHDRPWRV
GVRNPSAPAQ LLGVLELQGR GVVASSGDYE RGFLRAGRRL HHVLNPRTGW PTEGVSGVAL
MAERVEDVNG WGTALMVQGA AAAPAWHADH AHVEALVASA DGTPWSSPGM LAALRPVPAR
AG