Gene EcolC_1400 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1400 
Symbol 
ID6067885 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1532887 
End bp1534089 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content54% 
IMG OID641600820 
Productcompetence damage-inducible protein A 
Protein accessionYP_001724391 
Protein GI170019437 
COG category[R] General function prediction only 
COG ID[COG1058] Predicted nucleotide-utilizing enzyme related to molybdopterin-biosynthesis enzyme MoeA 
TIGRFAM ID[TIGR00177] molybdenum cofactor synthesis domain
[TIGR00200] competence/damage-inducible protein CinA N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.105485 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAAAAG TGGAAATGTT ATCCACCGGG GATGAAGTGT TACACGGGCA AATCGTTGAC 
ACTAACGCTG CCTGGCTGGC CGATTTTTTC TTTCATCAGG GGTTGCCATT ATCTCGCCGC
AATACGGTGG GGGATAACCT TGATGACTTA GTCACCATTC TTCGCGAACG TAGTCAGCAC
GCCGATGTGC TGATCGTTAA CGGCGGGCTG GGACCGACCA GCGATGATTT AAGCGCACTC
GCCGCTGCGA CAGCAAAAGG TGAAGGCCTG GTGCTGCATG AAGCCTGGCT CAAAGAGATG
GAACGCTATT TCCACGAACG TGGACGAGTA ATGGCACCGA GCAACCGTAA ACAAGCGGAG
CTGCCTGCCA GTGCTGAATT TATCAATAAC CCGGTAGGCA CCGCCTGTGG TTTTGCCGTG
CAGCTTAATC GTTGCCTGAT GTTCTTTACT CCCGGCGTAC CGTCAGAATT TAAGGTGATG
GTCGAGCACG AAATCCTGCC GCGCCTGCGC GAGCGTTTTT CTTTACCGCA GCCGCCGGTT
TGTCTGCGTT TGACTACTTT TGGTCGTTCG GAAAGCGATC TGGCACAAAG CCTGGACACT
CTACAACTGC CGCCGGGCGT AACAATGGGC TATCGCTCCT CAATGCCTAT CATCGAACTG
AAACTCACCG GACCGGCAAG CGAGCAACAG GCGATGGAAA AACTGTGGCT GGATGTTAAA
CGTGTTGCCG GACAGAGCGT GATTTTCGAA GGCACTGAAG GACTGCCCGC GCAGATCAGT
CGCGAATTGC AAAACCGCCA GTTCAGCCTG ACGTTGAGCG AGCAATTCAC CGGTGGTTTA
TTGGCTTTGC AACTTTCTCG CGCAGGTGCT CCATTGCTGG CGTGTGAAGT GGTTCCTTCA
CAGGAGGAAA CCCTGGCGCA AACTGCGCAC TGGATTACAG AACGGCGGGC CAACCATTTT
GCCGGGCTGG CACTGGCTGT TTCGGGTTTC GAGAACGAGC ATCTCAACTT TGCGCTAGCC
ACGCCAGACG GCACTTTCGC TCTGCGTGTG CGTTTCAGCA CTACGCGCTA CAGCCTGGCT
ATCCGTCAGG AAGTGTGCGC AATGATGGCA CTGAATATGC TGCGCCGTTG GTTAAACGGC
CAGGATATCG CCAGTGAGCA TGGCTGGATT GAGGTTGTTG AGTCCATGAC CTTATCTGTC
TGA
 
Protein sequence
MLKVEMLSTG DEVLHGQIVD TNAAWLADFF FHQGLPLSRR NTVGDNLDDL VTILRERSQH 
ADVLIVNGGL GPTSDDLSAL AAATAKGEGL VLHEAWLKEM ERYFHERGRV MAPSNRKQAE
LPASAEFINN PVGTACGFAV QLNRCLMFFT PGVPSEFKVM VEHEILPRLR ERFSLPQPPV
CLRLTTFGRS ESDLAQSLDT LQLPPGVTMG YRSSMPIIEL KLTGPASEQQ AMEKLWLDVK
RVAGQSVIFE GTEGLPAQIS RELQNRQFSL TLSEQFTGGL LALQLSRAGA PLLACEVVPS
QEETLAQTAH WITERRANHF AGLALAVSGF ENEHLNFALA TPDGTFALRV RFSTTRYSLA
IRQEVCAMMA LNMLRRWLNG QDIASEHGWI EVVESMTLSV