Gene EcolC_0388 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0388 
Symbol 
ID6066786 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp436601 
End bp438565 
Gene Length1965 bp 
Protein Length654 aa 
Translation table11 
GC content50% 
IMG OID641599787 
Productgeneral secretion pathway protein D 
Protein accessionYP_001723393 
Protein GI170018439 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1450] Type II secretory pathway, component PulD 
TIGRFAM ID[TIGR02517] general secretion pathway protein D 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0226374 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACTGCG TCATGAAAGG ACTCAATAAA ATCACCTGCT GCTTGCTGGC AGCACTACTC 
ATGCCTTGTG CAGGACACGC TGAGAACGAA CAATACGGCG CGAACTTCAA TAACGCCGAT
ATCCGCCAGT TCGTGGAAAT AGTGGGTCAG CATCTTGGCA AAACGATCCT GATCGACCCT
TCGGTACAGG GAACCATTTC CGTACGCAGT AATGATACGT TTAGCCAACA GGAGTACTAC
CAGTTCTTTT TAAGTATTCT TGATCTTTAC GGTTATTCCG TGATCACGCT GGACAATGGT
TTTCTGAGAG TGGTTCGCTC AGCTAATGTA AAAACATCGC CAGGGATGAT TGCTGACAGT
TCTCGTCCAG GCGTAGGTGA TGAGTTGGTC ACCCGAATCG TACCGCTTGA GAACGTTCCT
GCTCGTGACC TGGCCCCCCT GCTCCGCCAG ATGATGGATG CGGGTAGCGT CGGTAATGTT
GTGCATTATG AACCCTCCAA CGTTCTTATT CTGACCGGTC GTGCCTCCAC CATTAATAAA
CTGATTGAAG TCATAAAGCG CGTTGATGTC ATCGGCACAG AGAAGCAGCA AATTATTCAT
CTGGAATATG CGTCAGCGGA AGATCTCGCC GAGATTCTTA ATCAATTAAT CAGCGAAAGC
CACGGTAAAA GCCAGATGCC AGCCCTCCTC TCCGCGAAGA TTGTGGCGGA TAAGCGAACC
AACTCTCTTA TCATCAGTGG ACCGGAAAAA GCACGCCAGC GCATCACTTC ATTACTGAAA
AGCCTTGATG TCGAAGAGAG CGAGGAAGGA AATACCCGGG TTTATTACCT GAAATATGCT
AAAGCCACGA ATCTGGTGGA AGTGCTAACC GGTGTTTCCG AAAAGCTGAA AGATGAAAAA
GGGAATGCGC GTAAGCCCTC CTCTTCTGGC GCGATGGATA ACGTCGCCAT TACCGCCGAT
GAACAGACTA ACTCTCTGGT CATTACCGCT GACCAGTCCG TCCAGGAAAA ACTCGCCACG
GTAATTGCGC GTCTGGACAT TCGCCGTGCA CAGGTGCTGG TTGAGGCAAT CATCGTTGAA
GTTCAGGATG GAAATGGACT AAACCTCGGC GTGCAATGGG CGAATAAAAA CGTTGGCGCA
CAGCAATTTA CCAATACCGG ATTACCGATT TTTAACGCTG CGCAAGGTGT GGCTGATTAT
AAAAAGAATG GTGGGATCAC CAGCGCGAAT CCTGCCTGGG ATATGTTTAG CGCCTACAAT
GGCATGGCCG CAGGCTTCTT CAATGGCGAC TGGGGAGTAC TGCTTACCGC GCTGGCCAGT
AACAATAAAA ATGACATCCT CGCCACCCCA AGCATCGTAA CGCTGGATAA TAAACTCGCG
TCCTTCAACG TGGGGCAGGA TGTGCCGGTG CTATCCGGGT CACAGACCAC TTCAGGGGAT
AACGTCTTTA ATACCGTCGA ACGCAAAACG GTGGGGACAA AACTCAAAGT TACTCCGCAG
GTCAATGAAG GCGACGCGGT GTTGCTCGAA ATAGAGCAGG AAGTCTCCAG CGTTGACTCT
TCCTCTAACT CGACGCTCGG CCCGACGTTT AATACCCGTA CTATTCAAAA CGCCGTGCTG
GTCAAAACCG GTGAAACGGT GGTCCTGGGC GGATTGCTGG ATGATTTTTC TAAAGAGCAA
GTGTCAAAGG TTCCTCTGCT TGGCGATATT CCTTTAGTGG GGCAACTCTT CCGCTATACC
TCCACCGAGC GCGCTAAACG CAACCTGATG GTATTTATCC GTCCGACGAT TATCCGTGAC
GATGATGTTT ATCGCTCACT GTCAAAAGAG AAATACACCC GTTACCGTCA GGAGCAACAA
CAGCGGATCG ACGGGAAATC AAAAGCGCTG GTTGGCTCGG AAGATTTGCC GGTGCTGGAT
GAAAACACGT TCAACAGTCA CGCCCCTGCG CCATCGTCAC GGTGA
 
Protein sequence
MDCVMKGLNK ITCCLLAALL MPCAGHAENE QYGANFNNAD IRQFVEIVGQ HLGKTILIDP 
SVQGTISVRS NDTFSQQEYY QFFLSILDLY GYSVITLDNG FLRVVRSANV KTSPGMIADS
SRPGVGDELV TRIVPLENVP ARDLAPLLRQ MMDAGSVGNV VHYEPSNVLI LTGRASTINK
LIEVIKRVDV IGTEKQQIIH LEYASAEDLA EILNQLISES HGKSQMPALL SAKIVADKRT
NSLIISGPEK ARQRITSLLK SLDVEESEEG NTRVYYLKYA KATNLVEVLT GVSEKLKDEK
GNARKPSSSG AMDNVAITAD EQTNSLVITA DQSVQEKLAT VIARLDIRRA QVLVEAIIVE
VQDGNGLNLG VQWANKNVGA QQFTNTGLPI FNAAQGVADY KKNGGITSAN PAWDMFSAYN
GMAAGFFNGD WGVLLTALAS NNKNDILATP SIVTLDNKLA SFNVGQDVPV LSGSQTTSGD
NVFNTVERKT VGTKLKVTPQ VNEGDAVLLE IEQEVSSVDS SSNSTLGPTF NTRTIQNAVL
VKTGETVVLG GLLDDFSKEQ VSKVPLLGDI PLVGQLFRYT STERAKRNLM VFIRPTIIRD
DDVYRSLSKE KYTRYRQEQQ QRIDGKSKAL VGSEDLPVLD ENTFNSHAPA PSSR