Gene EcolC_4081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_4081 
Symbol 
ID6065604 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4505774 
End bp4507915 
Gene Length2142 bp 
Protein Length713 aa 
Translation table11 
GC content56% 
IMG OID641603503 
ProductYD repeat-containing protein 
Protein accessionYP_001727006 
Protein GI170022052 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGAA AACCGGCAGC GCGTCAGGGC GACATGACGC AGTATGGCGG TAGCATTGTT 
CAGGGTTCAG CCGGGGTGCG CATTGGTGCC CCCACCGGCG TGGCCTGTTC GGTGTGCCCC
GGCGGAGTGA CGTCCGGCCA TCCGGTCAAT CCCCTGCTCG GTGCAAAGGT CCTTCCCGGT
GAAACCGACA TCGCCCTGCC CGGCCCGCTG CCGTTCATCC TCTCCCGCAC CTACAGCAGT
TACCGGACAA AAACGCCCGC GCCGGTGGGG AGCCTCGGCC CCGGCTGGAA AATGCCTGCG
GATATCCGCT TACAGCTGCG CGATAATACA CTGATACTCA GTGATAACGG CGGCAGAAGC
CTGTATTTTG AGCACCTGTT TCCCGGTGAG GACGGTTACA GCCGCAGCGA GTCACTGTGG
CTGGTGCGCG GCGGCGTCCT GAGACTGGAT GAAGGTCACC GGCTGGCCGC ACTCTGGCAG
GCGCTGCCGG AAGAACTCCG CTTAAGTCCG CATCGTTATC TGGCGACAAA CAGTCCGCAG
GGGCCGTGGT GGCTGCTCGG CTGGTGTGAG CGGGTGCCGG AAGCGGATGA GGTGCTGCCT
GCGCCGCTGC CGCCGTACCG GGTACTGACC GGGCTGGTGG ACCGCTTCGG GCGCACACAG
ACGTTCCACC GCGAAGCCGC CGGTGAATTC AGCGGCGAAA TCACCGGCGT GACGGATGGT
GCCGGGCGTC ACTTCCGGCT GGTACTGACC ACGCAGGCGC AGCGGGCAGA AGAAGCCCGG
CAGCAGGCCA TTTCCGGCGG GACGGAACCG TCCGCTTTTC CTGATACCCT GCCGGGTTAC
ACCGAATATG GCCGGGACAA CGGCATCCGT CTGTCTGCCG TGTGGCTGAC GCACGACCCG
GAATACCCGG AGAATTTACC TGCCGCGCCG CTGGTACGCT ATGGCTGGAC GCCGCGCGGC
GAACTGGCGG TGGTGTATGA CCGTAGTGGC AAACAGGTGC GCAGCTTTAC TTACGATGAT
AAATACCGGG GCCGGATGGT GGCGCACCGT CACACGGGCC GGCCGGAAAT CCGTTACCGT
TACGACAGCG ACGGGCGGGT GACAGAACAG CTAAACCCGG CAGGCTTAAG CTACACGTAT
CAGTATGAGA AAGACCGCAT CACCATCACC GACAGCCTGA ACCGCCGTGA AGTCCTGCAT
ACCGCAGGCG AAGGCGGGCT GAAGCGGGTG GTGAAAAAGG AACACGCGGA CGGCAGCGTC
ACGCAGAGTC AGTTTGACGC GGTGGGCAGG CTCAGGGCAC AGACGGATGC CGCAGGCAGG
ACAACAGAAT ACAGCCCGGA TGTGGTGACG GGCCTCATCA CGCGCATCAC CACGCCGGAT
GGCAGGGCAT CGGCGTTTTA CTATAACCAC CACAGCCAGT TAACGTCAGC CACCGGGCCT
GACGGGCTGG AAATACGCCG GGAATACGAT GAATGGGGTA ACCAGCTGAA TGAAGAGAAC
CCGCACCAGC TGCAGCAGCT CATCCGCCTG CCGGGGCAGC AGTATGATGA GGAGTCCGGC
CTGTATTATA ACCGCCACCG CTATTATGAC CCGCTGCGGG GGAGGTATAT CACTCAGGAT
CCGATTGGGC TGAAGGGGGG ATGGAACCTG TATACATATC CGCTGAGCCC GGTGAATAGC
ATGGATCCAT TAGGATTATA TGAATTTAAA TCAAAAAATA TAGATGATAT TGGAATATTT
GCATTGGCAA TGTGTAATGG AGAATCAATT AACGAGAATA AAGAATATGG TGGACTAATA
TGTAAGAAGC AAGGTGAATA TTTCCCCATG AATCCGATAA GTTCAAATGA TAATGATAGT
GTAGACTTGC GAAATATAAA ATGCCCTGAA GGTTCAGAGA GAGTAGGCGA TTATCACACT
CACGGTTTTT ACTCTGACGA TAAAGGAAAT AAAGTAACAA AAGAAAATGA TGTTTATGAT
AGTCTAAATT TTTCAAGCAA AGATTTAACG AATTCTTATA TGAATGGAAT GGGAAAAAAA
GAATACAGTA GTTACTTGGG AACACCAAAT GACACCTATC TAAAATATAA TCCCAAAGCT
AAAGGGAATG GAGTTACAAT TATCAGGCAA GGGAGTAATT AA
 
Protein sequence
MSGKPAARQG DMTQYGGSIV QGSAGVRIGA PTGVACSVCP GGVTSGHPVN PLLGAKVLPG 
ETDIALPGPL PFILSRTYSS YRTKTPAPVG SLGPGWKMPA DIRLQLRDNT LILSDNGGRS
LYFEHLFPGE DGYSRSESLW LVRGGVLRLD EGHRLAALWQ ALPEELRLSP HRYLATNSPQ
GPWWLLGWCE RVPEADEVLP APLPPYRVLT GLVDRFGRTQ TFHREAAGEF SGEITGVTDG
AGRHFRLVLT TQAQRAEEAR QQAISGGTEP SAFPDTLPGY TEYGRDNGIR LSAVWLTHDP
EYPENLPAAP LVRYGWTPRG ELAVVYDRSG KQVRSFTYDD KYRGRMVAHR HTGRPEIRYR
YDSDGRVTEQ LNPAGLSYTY QYEKDRITIT DSLNRREVLH TAGEGGLKRV VKKEHADGSV
TQSQFDAVGR LRAQTDAAGR TTEYSPDVVT GLITRITTPD GRASAFYYNH HSQLTSATGP
DGLEIRREYD EWGNQLNEEN PHQLQQLIRL PGQQYDEESG LYYNRHRYYD PLRGRYITQD
PIGLKGGWNL YTYPLSPVNS MDPLGLYEFK SKNIDDIGIF ALAMCNGESI NENKEYGGLI
CKKQGEYFPM NPISSNDNDS VDLRNIKCPE GSERVGDYHT HGFYSDDKGN KVTKENDVYD
SLNFSSKDLT NSYMNGMGKK EYSSYLGTPN DTYLKYNPKA KGNGVTIIRQ GSN