Gene EcolC_3961 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3961 
Symbol 
ID6064488 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4351067 
End bp4352068 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content34% 
IMG OID641603374 
Productpentapeptide repeat-containing protein 
Protein accessionYP_001726889 
Protein GI170021935 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0697467 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAATAA TAAATGTGAC GATAGAATTA CGAGAAGAGT TTACCTTAGT AAAGAAAAGG 
AAGTATCCAT TAAGGTGGTA TATTATTATA AATAATGTCG CCGTCCATAA TAATACTATC
GAAATTCCAC AGACAGTAAA TGGTGGTTAC GATTTTTCAC ACCTTAGCCT GAAAGGTATC
GTGATTAAAG ATGAAGATTT ATCCAATTCG AATTTTGCAG GTTGCAGACT ACAAAACGCT
ATTTTCCAGG ACTGTAATAT GTATAAAACG AATTTTTATT ACGCCATAAT GGAAAAAATA
CTTTTTGATA ATTGTATTCT CGATGACTCA AATTTCGCTC AGATAAAAAT GGCCGACGGA
ACTCTAAATG CATGCTCCGC TATGCATGTT CAATTCTACA ATGCAGCAAT GAATAGAGCC
AATATTAAAA ATACCTTTCT TGACTATTCA AATTTTTATA TGGCGTACAT GGCTGAGGTA
AATCTTTATA AAGTAATAGC GCCATATGTT AATTTATTTA AAGCCGACCT TAGTTTCTCT
AAACTCGATT TAATTAACTT TGAACATGCT GATCTGTCTC GCGTCAATCT GAACAAAGCA
ATCCTCCAGA ATATAAACTT AATTGATAGC AAACTCTTTT GTACGTGGCT AACAAATACA
TTCCTCGAAA TGGTTATATG TACCGACTCT AATATGGCTA ATGTTAATTT TAATAATGCC
AATTTAAGCA ATTGCCATTT CAACTGTTCT GTTTTAACAA AAGCCTGGAT GTTTAATATC
CGTCTCTATC GTGTTAATTT CGATGAGGCT AGCGTCCAGG GAATGGGTAT TACCATTCTC
CGTGGTGAGG AAAATATCTC CATTAATAGT GATACCCTGG TAACACTACA GAAATTCTTT
GAAGAAGATT GTACCTCTCA TACTGGCATG TCACAAACTG AGGATAATAT TAATGCAGTC
GCTATGAAGA TTACTGCAGA TATTATGCAA CACGCAGATT GA
 
Protein sequence
MQIINVTIEL REEFTLVKKR KYPLRWYIII NNVAVHNNTI EIPQTVNGGY DFSHLSLKGI 
VIKDEDLSNS NFAGCRLQNA IFQDCNMYKT NFYYAIMEKI LFDNCILDDS NFAQIKMADG
TLNACSAMHV QFYNAAMNRA NIKNTFLDYS NFYMAYMAEV NLYKVIAPYV NLFKADLSFS
KLDLINFEHA DLSRVNLNKA ILQNINLIDS KLFCTWLTNT FLEMVICTDS NMANVNFNNA
NLSNCHFNCS VLTKAWMFNI RLYRVNFDEA SVQGMGITIL RGEENISINS DTLVTLQKFF
EEDCTSHTGM SQTEDNINAV AMKITADIMQ HAD