Gene EcolC_3527 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3527 
Symbol 
ID6065550 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3851713 
End bp3852615 
Gene Length903 bp 
Protein Length300 aa 
Translation table11 
GC content49% 
IMG OID641602944 
Productputative transposase YhgA family protein 
Protein accessionYP_001726468 
Protein GI170021514 
COG category[S] Function unknown 
COG ID[COG5464] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01784] conserved hypothetical protein (putative transposase or invertase) 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.653665 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGCAC CGAGTACCAC ACCGCATGAC GCGGTATTTA AACAATTTTT AATGCATGCG 
GAGACGGCTC GCGACTTTCT GGAGATACAT TTGCCAGTGG AATTACGCGA ACTTTGTGAC
CTCAACACGC TTCATTTAGA GTCGGGGAGT TTCATTGAAG AGAGCCTGAA AGGACACAGC
ACGGACGTGC TCTATTCCGT GCAAATGCAG GGCAATCCCG GTTATCTGCA TGTTGTGATT
GAACACCAAA GCAAGCCGGA TAAGAAAATG GCCTTTCGCA TGATGCGTTA TTCTATAGCC
GCCATGCACC GGCATCTGGA GGCTGACCAC GATAAGCTGC CGCTGGTGGT GCCGATACTG
TTTTATCAGG GCGAGGCCAC ACCTTATCCG CTATCAATGT GCTGGTTTGA TATGTTTTAC
TCGCCGGAGC TGGCGCGACG CGTCTATAAC AGTCCTTTCC CGCTGGTGGA TATCACCATC
ACACCGGATG ACGAAATCAT GCAACATCGG CGGATTGCGA TTCTCGAACT ACTGCAAAAA
CATATTCGCC AGCGCGACTT AATGTTATTG CTTGAGCAAC TGGTCACGCT GATCGACGAA
GGGTACACTA GCGGAAGTCA GTTAGTTGCC ATGCAAAACT ATATGCTGCA ACGCGGTCAT
ACTGAACAAG CGGATTTGTT TTACGGTGTG TTGAGAGACA GGGAAACGGG AGGGGAGTCT
ATGATGACGC TGGCGCAGTG GTTTGAAGAG AAAGGGATTG AGAAGGGGAT TCAGCAGGGA
AGACAGGAAG TAAGTCAGGA ATTCGCCCAG CGTCTTCTGA GTAAAGGAAT GTCTCGGGAA
GACGTTGCAG AGATGGCAAA TTTACCTCTT GCTGAGATTG ATAAGGTAAT TAACCTTATT
TAA
 
Protein sequence
MDAPSTTPHD AVFKQFLMHA ETARDFLEIH LPVELRELCD LNTLHLESGS FIEESLKGHS 
TDVLYSVQMQ GNPGYLHVVI EHQSKPDKKM AFRMMRYSIA AMHRHLEADH DKLPLVVPIL
FYQGEATPYP LSMCWFDMFY SPELARRVYN SPFPLVDITI TPDDEIMQHR RIAILELLQK
HIRQRDLMLL LEQLVTLIDE GYTSGSQLVA MQNYMLQRGH TEQADLFYGV LRDRETGGES
MMTLAQWFEE KGIEKGIQQG RQEVSQEFAQ RLLSKGMSRE DVAEMANLPL AEIDKVINLI