Gene Pden_4190 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPden_4190 
Symbol 
ID4582740 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParacoccus denitrificans PD1222 
KingdomBacteria 
Replicon accessionNC_008687 
Strand
Start bp1345444 
End bp1346739 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content65% 
IMG OID639771497 
Productextracellular solute-binding protein 
Protein accessionYP_917950 
Protein GI119386895 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.224886 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAGAA CCCTATTCTG GCTTGCGGGG ATGATGGCGC TGCCCGTCGC GGCCGAGGCC 
AAGACCGAAA TCTCGTGGTG GCACGCGATG ACCGGCGCCA ATGCCGAGGT GGTGACCAAG
ATCGCCGGCG ACTTCAACGC GAGCCAATCG GATTACGAGG TGAAGCCGGT CTTCAAGGGC
ACCTATCCCG AGACGCTGAA CGCCGGCATC GCCGCCTTCC GCGCTGGGCA GGCGCCCGAC
ATCATTCAGG TCTTCGACGT GGGCACGGGC GTGATGATGG GCGCGCAGGG CGCGATCAAG
CCGGTGGCCG AGGTGCTGCA AGAGGGCGGC TATGCCTTCG ACAAGGGGCA GTATCTGCCC
GGCATCGTCG GCTATTACTC GACGCCCGAG GGCGACATGC TGTCCTTCCC CTACAACTCC
TCCTCGCCGA TCCTGTATTA CAACAAGGAC ATCTTCGAAA AGGCGGGGCT GGATGTCGAG
AACCCGCCCA AGACCTGGGC CGAGGTCTGG GCCGCCGCGC GCCAGATCAA GGAATCGGGG
GCCGCGACCT GCGGCTATAC CTCGACCTGG CTGACCTGGA TCCATACCGA GAACTTCGCC
GCCTGGAACG ACGTCAGCTG GGGCACGAAC GAGAACGGCC TGGCCGGCCC GCCCGAACTG
AAGATCGACG GCCCGCTGTT CGTCAAGCAC TTCCAGGAGC TTGCGGATCT GGCGAAAGAG
GGCGTCTTCG TCTATGGCGG CCGCACCAGC GAGGCCAAGC AGAACTTCAC CTCGGGCGAA
TGCGGCATCC TGACCGAAAG CTCGGGCGGG CTTGGCGACA TCGTCAAGTC GGGCATGAAC
TACGGCATCG GCCAGCTGCC CTATGACGAG ACGGCCGAGG GTGCGCCGCA GAACACCACG
CCGGGCGGCG CCTCGCTCTG GGTGATGGGC GGCAAGTCGG AAGAGACCTA CAAGGGCGTC
GCCGCCTTCT TCAACTACCT GTCGCAGACC GACGTGCAGC AATACCTGCA CGAGCAGTCG
GGCTATCTGC CGGTGACCAT GGCGGCTTAC GAAGTCACGA AGGGTTCGGA CTTCTACCAG
CAGAACCCGG GCCGCGAGAC GCCGATCCTG CAGATGATGG GCAAAGAGCC GACCGAGAAC
TCCAAGGGCG TGCGCGCGCC GAACCTGCCG CAGCTGCGCG ACATCCAGAA CGAGGAATAC
GAAAAGATGC TGGCCGGCCA GCAGGACGCG GCCACGGCGC TGAAGAACGC CGTCGAGCGC
GGCAATGCCG CGATCAGGGA AGCCGTGGGC GGCTGA
 
Protein sequence
MQRTLFWLAG MMALPVAAEA KTEISWWHAM TGANAEVVTK IAGDFNASQS DYEVKPVFKG 
TYPETLNAGI AAFRAGQAPD IIQVFDVGTG VMMGAQGAIK PVAEVLQEGG YAFDKGQYLP
GIVGYYSTPE GDMLSFPYNS SSPILYYNKD IFEKAGLDVE NPPKTWAEVW AAARQIKESG
AATCGYTSTW LTWIHTENFA AWNDVSWGTN ENGLAGPPEL KIDGPLFVKH FQELADLAKE
GVFVYGGRTS EAKQNFTSGE CGILTESSGG LGDIVKSGMN YGIGQLPYDE TAEGAPQNTT
PGGASLWVMG GKSEETYKGV AAFFNYLSQT DVQQYLHEQS GYLPVTMAAY EVTKGSDFYQ
QNPGRETPIL QMMGKEPTEN SKGVRAPNLP QLRDIQNEEY EKMLAGQQDA ATALKNAVER
GNAAIREAVG G