Gene ECD_01021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_01021 
SymbolycdB 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp1087659 
End bp1088930 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content54% 
IMG OID 
Producthypothetical protein 
Protein accessionACT42916 
Protein GI253977246 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.211947 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGTATA AAGATGAAAA CGGCGTGAAT GAACCGTCAC GCCGACGTTT ACTGAAAGTG 
ATAGGTGCAC TGGCGCTGGC GGGAAGTTGT CCGGTCGCTC ATGCACAAAA AACGCAAAGT
GCGCCGGGTA CGCTTTCACC GGATGCTCGC AATGAGAAAC AGCCGTTTTA TGGTGAGCAT
CAGGCAGGGA TCCTGACGCC ACAACAGGCC GCAATGATGC TGGTGGCGTT TGATGTGCTT
GCCAGCGATA AAGCCGATCT TGAGCGGTTG TTTCGCTTGT TGACTCAGCG TTTTGCTTTT
CTGACTCAGG GCGGAGCAGC ACCAGAAACG CCAAATCCGC GCCTGCCACC ACTCGATTCC
GGCATTCTTG GCGGCTACAT TGCGCCCGAT AATCTCACCA TCACGTTATC GGTGGGTCAC
TCATTGTTTG ATGAGCGCTT TGGCCTTGCG CCACAGATGC CAAAAAAGCT GCAGAAGATG
ACGCGTTTCC CCAACGACTC GCTGGATGCG GCGTTATGTC ATGGTGATGT GTTGCTACAG
ATTTGCGCCA ACACCCAGGA CACGGTTATC CATGCGCTGC GCGATATCAT CAAACACACG
CCGGATTTGC TCAGTGTGCG CTGGAAGCGG GAAGGGTTTA TTTCCGATCA CGCGGCGCGT
AGTAAAGGCA AAGAGACGCC GATTAATTTG CTGGGTTTCA AAGACGGCAC TGCCAATCCC
GATAGCCAGA ATGATAAGTT GATGCAAAAA GTGGTGTGGG TAACGGCAGA TCAGCAGGAG
CCTGCGTGGA CAATCGGTGG CAGCTATCAG GCAGTACGCT TGATTCAGTT TCGAGTGGAA
TTTTGGGACA GAACGCCGCT GAAAGAACAG CAGACGATTT TTGGCCGTGA TAAGCAAACC
GGTGCGCCGC TGGGAATGCA GCATGAGCAT GATGTGCCTG ATTACGCCAG CGACCCGGAA
GGGAAGGTGA TCGCGCTGGA CAGCCATATC CGGCTGGCGA ATCCCCGCAC GGCGGAGAGT
GAGTCCAGCC TGATGCTGCG TCGTGGCTAC AGTTATTCAC TGGGCGTCAC CAACTCCGGG
CAACTGGATA TGGGATTGCT GTTTGTCTGC TACCAACACG ATCTGGAAAA AGGCTTCCTG
ACAGTACAAA AAAGGCTCAA TGGCGAAGCG CTGGAGGAAT ACGTTAAACC TATCGGCGGC
GGTTATTTTT TTGCGCTGCC GGGGGTGAAG GACGCGAACG ATTATTTCGG AAGCGCGTTA
TTGCGGGTTT AA
 
Protein sequence
MQYKDENGVN EPSRRRLLKV IGALALAGSC PVAHAQKTQS APGTLSPDAR NEKQPFYGEH 
QAGILTPQQA AMMLVAFDVL ASDKADLERL FRLLTQRFAF LTQGGAAPET PNPRLPPLDS
GILGGYIAPD NLTITLSVGH SLFDERFGLA PQMPKKLQKM TRFPNDSLDA ALCHGDVLLQ
ICANTQDTVI HALRDIIKHT PDLLSVRWKR EGFISDHAAR SKGKETPINL LGFKDGTANP
DSQNDKLMQK VVWVTADQQE PAWTIGGSYQ AVRLIQFRVE FWDRTPLKEQ QTIFGRDKQT
GAPLGMQHEH DVPDYASDPE GKVIALDSHI RLANPRTAES ESSLMLRRGY SYSLGVTNSG
QLDMGLLFVC YQHDLEKGFL TVQKRLNGEA LEEYVKPIGG GYFFALPGVK DANDYFGSAL
LRV