Gene Pden_1901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPden_1901 
Symbol 
ID4581121 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParacoccus denitrificans PD1222 
KingdomBacteria 
Replicon accessionNC_008686 
Strand
Start bp1900774 
End bp1901832 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content68% 
IMG OID639769221 
ProductABC transporter related 
Protein accessionYP_915692 
Protein GI119384636 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4175] ABC-type proline/glycine betaine transport system, ATPase component 
TIGRFAM ID[TIGR03415] choline ABC transporter, ATP-binding protein 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.337326 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGAAC GCATGCAAGC CGCACATAAT GCGGTCGAAT TCGACAATGT GAACATCGTG 
TTCGGCGACG CGCCCAATGC CGCGCTGCCG CTGATGGACC AAGGGCTGGA ACGCGGCCCG
ATCAAGGCCG AGACCGGACA GGTGTTGGGC GTTCACAATT GCTCGCTGAC CGTGCGCGAG
GGCGAGATCC TGGTGCTGAT GGGCCTGTCC GGCTCGGGCA AGTCCACGCT TCTGCGGGCG
GTGAACGGGC TCAACGCCGT CTGCCGCGGC GAGGTGCGGA TCTGGGACGG CGACCGCATG
GCCTCGGTCA CGAAAGCCTC GGGCGCCGAG TTGCGGCGAC TCAGGCGCGA ATGCATCGCC
ATGGTCTTCC AGCAATTCGG CCTGCTTCCC TGGCGCTCGG TACGCGAGAA CGTGGGGCTG
GGGCTGGAAC TTGCCGGCAT CCCCGCGGCC GAGCGGCGCA AGCGCGTCGA TGCCCAGCTC
GCCACCGTCG GCCTGGCCGA ATGGGCCGAG CGCAAGGTGG GCGACCTTTC GGGCGGCATG
CAGCAGCGCG TCGGCCTGGC ACGCGCATTC GCCACCGAGG CGCCGATCCT GCTGATGGAC
GAGCCCTTCT CGGCGCTGGA CCCGCTGATC CGCAACCGGC TCCAGGACGA GTTGCTGGAA
TTGCAGCAGC GGTTCCGCCG CACCATCATC TTCGTCAGCC ACGACCTCGA CGAGGCGTTC
CGCATCGGCA ACCGCATCGC GCTGATGGAG GGCGGCCGCA TCGTGCAGGT CGGCACCGCG
CGCGAGATCA TCGCCAATCC GGTCAACGCC TATGTCGAGG ATTTCGTGGC CCACATGAAC
CCGCTGGCGG TGCTGACCGC CGAGGACATG GCCGAGCCGG GCGAAGCGGC GGGCGAGCCC
ATCGACCCCG AAACCCCGGT GCGCGAGGTC ATCCAGATCA TGACCGAGGC GGATGCCGAG
ATGCTGCCCC TGCAAGGCGG ACGGCGGGTG ACGCGGCAGG GCATCATGAC GCGCCTGGTC
GCGCAACGCA GCCAGGGCGG CGGTGCCGTC GAGCACTAG
 
Protein sequence
MSERMQAAHN AVEFDNVNIV FGDAPNAALP LMDQGLERGP IKAETGQVLG VHNCSLTVRE 
GEILVLMGLS GSGKSTLLRA VNGLNAVCRG EVRIWDGDRM ASVTKASGAE LRRLRRECIA
MVFQQFGLLP WRSVRENVGL GLELAGIPAA ERRKRVDAQL ATVGLAEWAE RKVGDLSGGM
QQRVGLARAF ATEAPILLMD EPFSALDPLI RNRLQDELLE LQQRFRRTII FVSHDLDEAF
RIGNRIALME GGRIVQVGTA REIIANPVNA YVEDFVAHMN PLAVLTAEDM AEPGEAAGEP
IDPETPVREV IQIMTEADAE MLPLQGGRRV TRQGIMTRLV AQRSQGGGAV EH