Gene Pden_1899 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPden_1899 
Symbol 
ID4581119 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParacoccus denitrificans PD1222 
KingdomBacteria 
Replicon accessionNC_008686 
Strand
Start bp1898812 
End bp1899765 
Gene Length954 bp 
Protein Length317 aa 
Translation table11 
GC content66% 
IMG OID639769219 
Productsubstrate-binding region of ABC-type glycine betaine transport system 
Protein accessionYP_915690 
Protein GI119384634 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components 
TIGRFAM ID[TIGR03414] choline ABC transporter, periplasmic binding protein 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.526364 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTTTC GTCGCACCAC CGCCGCTCTT GCCGCCGGGC TGGTCGCCTC GCTCGCCACG 
GCTGGCATCG CTGCGGCAGA AGAGCCGATG GAGTGCCGCA AGGTCGTCTT CTCGGATGTC
GGCTGGAGCG ATATCAGCGC CACCACGGCG CTGGCCTCGA CCGTGCTCCA GGCCCTCGGC
TACCAGACCG AGACCAAGAT CCTGTCGGTG CCGGTGACCT ATACCGCCAT GTCGACCGAC
GACGTGGACG TGTTCCTGGG CAACTGGATG CCGACGATGG AGGCCGATAT CGCCCCCTAT
CGCGAGGCGG GCACGGTCGA GATCGTGCGC ACGAACCTGA CCGGGGCGAA ATACACGCTG
GCGACGAACC AGGCCGGCGC CGATCTGGGC ATCGACGATT TCGGCAAGAT CGCCCAGCAC
AAGGACGCGC TGGCCGGCAA GATCTACGGC ATCGAGCCGG GCAATGACGG CAACCGCCTG
CTGCTGGACA TGGTGGCCGA CAACAAGTTC GACCTAGGCA CCTTCGAGGT CGTCGAAAGC
AGCGAACAGG GCATGCTGGC GCAGGTCGCC CGTGCCGACG CCGCCGGCAA GCCGGTGATC
TTCCTGGGCT GGGAGCCGCA TCCGATGAAC AGCCAGTTCC AGATGACCTA CCTGTCCGGC
GGCGACGAGG TCTTCGGCCC CGACTTCGGC GGCGCGCGGG TCGATACCAA CACCCGCGCC
GGCTATGTCG AGGCCTGCCC GAACGTCGGC AAGTTCCTGC AGAACCTGGA ATTCACCCTG
CCCATGGAGA ACGAGGTCAT GGGCCTGATC CTGAACGACG GCGAGCAGCC CGCCGATGCG
GCGCTGAAAT GGCTGAAGGC CAACCCGGAC GCGGCAAAAC CCTGGATCGC GGGCGTGACC
GCTGCCGATG GCGGCGATGC GCAGGCGGCC CTGGACACGG TGCTGTCCAA GTGA
 
Protein sequence
MTFRRTTAAL AAGLVASLAT AGIAAAEEPM ECRKVVFSDV GWSDISATTA LASTVLQALG 
YQTETKILSV PVTYTAMSTD DVDVFLGNWM PTMEADIAPY REAGTVEIVR TNLTGAKYTL
ATNQAGADLG IDDFGKIAQH KDALAGKIYG IEPGNDGNRL LLDMVADNKF DLGTFEVVES
SEQGMLAQVA RADAAGKPVI FLGWEPHPMN SQFQMTYLSG GDEVFGPDFG GARVDTNTRA
GYVEACPNVG KFLQNLEFTL PMENEVMGLI LNDGEQPADA ALKWLKANPD AAKPWIAGVT
AADGGDAQAA LDTVLSK