Gene EcHS_A3519 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3519 
SymbolgspD1 
ID5593286 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3500708 
End bp3502672 
Gene Length1965 bp 
Protein Length654 aa 
Translation table11 
GC content50% 
IMG OID640922636 
Productgeneral secretion pathway protein D 
Protein accessionYP_001460117 
Protein GI157162799 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1450] Type II secretory pathway, component PulD 
TIGRFAM ID[TIGR02517] general secretion pathway protein D 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.0147755 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACTGCG TCATGAAAGG ACTCAATAAA ATCACCTGCT GCTTGCTGGC AGCACTACTC 
ATGCCTTGTG CAGGACACGC TGAGAACGAA CAATACGGCG CGAACTTCAA TAACGCCGAT
ATCCGCCAGT TCGTGGAAAT AGTGGGTCAG CATCTTGGCA AAACGATCCT GATCGACCCT
TCGGTACAGG GAACCATTTC CGTACGCAGT AATGATACGT TTAGCCAACA GGAGTACTAC
CAGTTCTTTT TAAGTATTCT TGATCTTTAC GGTTATTCCG TGATCACGCT GGACAATGGT
TTTCTGAGAG TGGTTCGCTC AGCTAATGTA AAAACATCGC CAGGGATGAT TGCTGACAGT
TCTCGTCCAG GCGTAGGTGA TGAGTTGGTC ACCCGAATCG TACCGCTTGA GAACGTTCCT
GCTCGTGACC TGGCCCCCCT GCTCCGCCAG ATGATGGATG CGGGTAGCGT CGGTAATGTT
GTGCATTATG AACCCTCCAA CGTTCTTATT CTGACCGGTC GTGCCTCCAC CATTAATAAA
CTGATTGAAG TCATAAAGCG CGTTGATGTC ATCGGCACAG AGAAGCAGCA AATTATTCAT
CTGGAATATG CGTCAGCGGA AGATCTCGCC GAGATTCTTA ATCAATTAAT CAGCGAAAGC
CACGGTAAAA GCCAGATGCC AGCCCTCCTC TCCGCGAGGA TTGTGGCGGA TAAGCGAACC
AACTCTCTTA TCATCAGTGG ACCGGAAAAA GCACGCCAGC GCATCACTTC ATTACTGAAA
AGCCTTGATG TCGAAGAGAG CGAGGAAGGA AATACCCGGG TTTATTACCT GAAATATGCT
AAAGCCACGA ATCTGGTGGA AGTGCTAACC GGTGTTTCCG AAAAGCTGAA AGATGAAAAA
GGGAATGCGC GTAAGCCCTC CTCTTCTGGC GCGATGGATA ACGTCGCCAT TACCGCCGAT
GAACAGACTA ACTCTCTGGT CATTACCGCT GACCAGTCCG TCCAGGAAAA ACTCGCCACG
GTAATTGCGC GTCTGGACAT TCGCCGTGCA CAGGTGCTGG TTGAGGCAAT CATCGTTGAA
GTTCAGGATG GAAATGGACT AAACCTCGGC GTGCAATGGG CGAATAAAAA CGTTGGCGCA
CAGCAATTTA CCAATACCGG ATTACCGATT TTTAACGCTG CGCAAGGTGT GGCTGATTAT
AAAAAGAATG GTGGGATCAC CAGCGCGAAT CCTGCCTGGG ATATGTTTAG CGCCTACAAT
GGCATGGCCG CAGGCTTCTT CAATGGCGAC TGGGGAGTAC TGCTTACCGC GCTGGCCAGT
AACAATAAAA ATGACATCCT CGCCACCCCA AGCATCGTAA CGCTGGATAA TAAACTCGCG
TCCTTCAACG TGGGGCAGGA TGTGCCGGTG CTATCCGGGT CACAGACCAC TTCAGGGGAT
AACGTCTTTA ATACCGTCGA ACGCAAAACG GTGGGGACAA AACTCAAAGT TACTCCGCAG
GTCAATGAAG GCGACGCGGT GTTGCTCGAA ATAGAGCAGG AAGTCTCCAG CGTTGACTCT
TCCTCTAACT CGACGCTCGG CCCGACGTTT AATACCCGTA CTATTCAAAA CGCCGTGCTG
GTCAAAACCG GTGAAACGGT GGTCCTGGGC GGATTGCTGG ATGATTTTTC TAAAGAGCAA
GTGTCAAAGG TTCCTCTGCT TGGCGATATT CCTTTAGTGG GGCAACTCTT CCGCTATACC
TCCACCGAGC GCGCTAAACG CAACCTGATG GTATTTATCC GTCCGACGAT TATCCGTGAC
GATGATGTTT ATCGCTCACT GTCAAAAGAG AAATACACCC GTTACCTTCA GGAGCAACAA
CAGCGGATCG ACGGGAAATC AAAAGCGCTG GTTGGCTCGG AAGATTTGCC GGTGCTGGAT
GAAAACACGT TCAACAGTCA CGCCCCTGCG CCATCGTCAC GGTGA
 
Protein sequence
MDCVMKGLNK ITCCLLAALL MPCAGHAENE QYGANFNNAD IRQFVEIVGQ HLGKTILIDP 
SVQGTISVRS NDTFSQQEYY QFFLSILDLY GYSVITLDNG FLRVVRSANV KTSPGMIADS
SRPGVGDELV TRIVPLENVP ARDLAPLLRQ MMDAGSVGNV VHYEPSNVLI LTGRASTINK
LIEVIKRVDV IGTEKQQIIH LEYASAEDLA EILNQLISES HGKSQMPALL SARIVADKRT
NSLIISGPEK ARQRITSLLK SLDVEESEEG NTRVYYLKYA KATNLVEVLT GVSEKLKDEK
GNARKPSSSG AMDNVAITAD EQTNSLVITA DQSVQEKLAT VIARLDIRRA QVLVEAIIVE
VQDGNGLNLG VQWANKNVGA QQFTNTGLPI FNAAQGVADY KKNGGITSAN PAWDMFSAYN
GMAAGFFNGD WGVLLTALAS NNKNDILATP SIVTLDNKLA SFNVGQDVPV LSGSQTTSGD
NVFNTVERKT VGTKLKVTPQ VNEGDAVLLE IEQEVSSVDS SSNSTLGPTF NTRTIQNAVL
VKTGETVVLG GLLDDFSKEQ VSKVPLLGDI PLVGQLFRYT STERAKRNLM VFIRPTIIRD
DDVYRSLSKE KYTRYLQEQQ QRIDGKSKAL VGSEDLPVLD ENTFNSHAPA PSSR