Gene ECD_02838 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_02838 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp2970281 
End bp2972341 
Gene Length2061 bp 
Protein Length686 aa 
Translation table11 
GC content55% 
IMG OID 
ProductGspD, hypothetical type II secretion protein 
Protein accessionACT44642 
Protein GI253978972 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000664316 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTTTTGGC GTGATATTAC GTTGTCGGTC TGGCGTAAGA AGACAACTGG CCTCAAAACA 
AAAAAGCGTT TACTGCCGCT GGTGCTGGCA GCGGCATTAT GCAGTTCACC GGTTTGGGCG
GAAGAAGCCA CTTTCACCGC CAATTTTAAA GATACCGACC TGAAATCGTT CATCGAAACC
GTCGGCGCTA ACCTTAATAA AACCATCATT ATGGGGCCGG GCGTACAGGG GAAAGTGAGT
ATTCGTACCA TGACTCCGCT CAATGAACGC CAGTATTACC AGCTATTCCT TAACCTGCTG
GAAGCACAGG GGTATGCCGT CGTACCGATG GAAAACGACG TGCTGAAGGT GGTGAAATCC
AGCGCCGCGA AAGTCGAGCC ACTGCCGCTG GTCGGTGAAG GCAGCGACAA CTACGCGGGC
GATGAAATGG TCACCAAAGT AGTGCCGGTA CGTAATGTGT CGGTCCGCGA ATTGGCACCG
ATTCTGCGCC AGATGATCGA CAGCGCAGGC TCAGGCAACG TTGTTAATTA CGATCCCTCC
AACGTGATTA TGCTCACCGG ACGCGCCTCC GTTGTGGAGC GCCTGACGGA AGTGATCCAG
CGTGTGGATC ACGCGGGTAA TCGCACTGAA GAGGTGATCC CGCTGGATAA CGCCTCGGCT
TCGGAAATCG CCCGCGTGCT GGAAAGCCTG ACCAAAAACA GCGGCGAGAA CCAGCCAGCA
ACGCTGAAAT CTCAAATTGT CGCCGATGAA CGCACCAACA GCGTGATTGT CAGTGGTGAC
CCCGCCACGC GGGACAAAAT GCGCCGTCTG ATCCGTCGCC TGGACTCAGA AATGGAGCGC
AGCGGCAACA GCCAGGTTTT CTATCTCAAA TACAGCAAAG CCGAAGATCT GGTCGATGTA
CTGAAGCAGG TCAGCGGTAC GCTCACGGCG GCTAAAGAAG AGGCGGAAGG CACGGTCGGT
AGCGGGCGTG AGGTTGTCTC CATCGCCGCC AGCAAACACA GTAATGCCCT GATTGTTACC
GCGCCGCAGG ACATCATGCA GTCGCTGCAA AGCGTGATTG AACAACTGGA TATTCGCCGT
GCTCAGGTGC ATGTCGAGGC GTTGATCGTG GAAGTTGCTG AAGGCAGCAA TATCAACTTC
GGCGTGCAGT GGGCGTCGAA AGATGCCGGA TTAATGCAGT TTGCTAACGG TACGCAGATC
CCTATTGGCA CGCTGGGGGC AGCCATTTCT CAGGCGAAAC CGCAGAAAGG CTCGACGGTA
ATCAGTGAAA ACGGCGCTAC CACCATAAAT CCGGATACTA ACGGCGATCT CTCCACGCTT
GCTCAGCTTC TTTCCGGCTT TAGCGGTACG GCGGTTGGTG TGGTGAAAGG CGACTGGATG
GCGCTGGTGC AGGCGGTTAA AAACGACTCC AGCTCTAACG TGCTCTCCAC GCCGAGCATC
ACCACGCTGG ACAACCAGGA AGCCTTCTTC ATGGTGGGCC AGGACGTTCC GGTATTAACT
GGATCTACCG TTGGCTCCAA TAACAGCAAT CCTTTCAATA CGGTAGAGAG GAAAAAAGTC
GGCATCATGC TGAAAGTCAC GCCGCAGATT AACGAAGGAA ACGCGGTACA GATGGTGATT
GAGCAGGAAG TCTCGAAGGT GGAAGGACAG ACCAGCCTCG ACGTCGTGTT TGGTGAGCGC
AAACTGAAGA CCACCGTGCT GGCTAACGAT GGCGAGCTGA TTGTGCTTGG CGGTCTGATG
GATGATCAGG CAGGAGAAAG CGTGGCGAAA GTGCCGCTGC TGGGCGATAT CCCGTTGATT
GGTAACCTGT TTAAATCGAC GGCGGATAAA AAAGAAAAAC GTAACCTGAT GGTGTTTATC
CGCCCGACCA TTCTGCGTGA CGGTATGGCG GCAGACGGCG TGTCGCAGCG CAAATATAAC
TACATGCGCG CCGAGCAAAT CTACCGCGAT GAGCAAGGCT TAAGCCTGAT GCCGCACACC
GCGCAGCCGG TACTGCCAGC GCAAAACCAG GCCTTACCGC CGGAAGTTCG CGCATTCCTC
AATGCCGGGA GAACGCGTTA A
 
Protein sequence
MFWRDITLSV WRKKTTGLKT KKRLLPLVLA AALCSSPVWA EEATFTANFK DTDLKSFIET 
VGANLNKTII MGPGVQGKVS IRTMTPLNER QYYQLFLNLL EAQGYAVVPM ENDVLKVVKS
SAAKVEPLPL VGEGSDNYAG DEMVTKVVPV RNVSVRELAP ILRQMIDSAG SGNVVNYDPS
NVIMLTGRAS VVERLTEVIQ RVDHAGNRTE EVIPLDNASA SEIARVLESL TKNSGENQPA
TLKSQIVADE RTNSVIVSGD PATRDKMRRL IRRLDSEMER SGNSQVFYLK YSKAEDLVDV
LKQVSGTLTA AKEEAEGTVG SGREVVSIAA SKHSNALIVT APQDIMQSLQ SVIEQLDIRR
AQVHVEALIV EVAEGSNINF GVQWASKDAG LMQFANGTQI PIGTLGAAIS QAKPQKGSTV
ISENGATTIN PDTNGDLSTL AQLLSGFSGT AVGVVKGDWM ALVQAVKNDS SSNVLSTPSI
TTLDNQEAFF MVGQDVPVLT GSTVGSNNSN PFNTVERKKV GIMLKVTPQI NEGNAVQMVI
EQEVSKVEGQ TSLDVVFGER KLKTTVLAND GELIVLGGLM DDQAGESVAK VPLLGDIPLI
GNLFKSTADK KEKRNLMVFI RPTILRDGMA ADGVSQRKYN YMRAEQIYRD EQGLSLMPHT
AQPVLPAQNQ ALPPEVRAFL NAGRTR