Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_4081 |
Symbol | |
ID | 6065604 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 4505774 |
End bp | 4507915 |
Gene Length | 2142 bp |
Protein Length | 713 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641603503 |
Product | YD repeat-containing protein |
Protein accession | YP_001727006 |
Protein GI | 170022052 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3209] Rhs family protein |
TIGRFAM ID | [TIGR01643] YD repeat (two copies) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGGAA AACCGGCAGC GCGTCAGGGC GACATGACGC AGTATGGCGG TAGCATTGTT CAGGGTTCAG CCGGGGTGCG CATTGGTGCC CCCACCGGCG TGGCCTGTTC GGTGTGCCCC GGCGGAGTGA CGTCCGGCCA TCCGGTCAAT CCCCTGCTCG GTGCAAAGGT CCTTCCCGGT GAAACCGACA TCGCCCTGCC CGGCCCGCTG CCGTTCATCC TCTCCCGCAC CTACAGCAGT TACCGGACAA AAACGCCCGC GCCGGTGGGG AGCCTCGGCC CCGGCTGGAA AATGCCTGCG GATATCCGCT TACAGCTGCG CGATAATACA CTGATACTCA GTGATAACGG CGGCAGAAGC CTGTATTTTG AGCACCTGTT TCCCGGTGAG GACGGTTACA GCCGCAGCGA GTCACTGTGG CTGGTGCGCG GCGGCGTCCT GAGACTGGAT GAAGGTCACC GGCTGGCCGC ACTCTGGCAG GCGCTGCCGG AAGAACTCCG CTTAAGTCCG CATCGTTATC TGGCGACAAA CAGTCCGCAG GGGCCGTGGT GGCTGCTCGG CTGGTGTGAG CGGGTGCCGG AAGCGGATGA GGTGCTGCCT GCGCCGCTGC CGCCGTACCG GGTACTGACC GGGCTGGTGG ACCGCTTCGG GCGCACACAG ACGTTCCACC GCGAAGCCGC CGGTGAATTC AGCGGCGAAA TCACCGGCGT GACGGATGGT GCCGGGCGTC ACTTCCGGCT GGTACTGACC ACGCAGGCGC AGCGGGCAGA AGAAGCCCGG CAGCAGGCCA TTTCCGGCGG GACGGAACCG TCCGCTTTTC CTGATACCCT GCCGGGTTAC ACCGAATATG GCCGGGACAA CGGCATCCGT CTGTCTGCCG TGTGGCTGAC GCACGACCCG GAATACCCGG AGAATTTACC TGCCGCGCCG CTGGTACGCT ATGGCTGGAC GCCGCGCGGC GAACTGGCGG TGGTGTATGA CCGTAGTGGC AAACAGGTGC GCAGCTTTAC TTACGATGAT AAATACCGGG GCCGGATGGT GGCGCACCGT CACACGGGCC GGCCGGAAAT CCGTTACCGT TACGACAGCG ACGGGCGGGT GACAGAACAG CTAAACCCGG CAGGCTTAAG CTACACGTAT CAGTATGAGA AAGACCGCAT CACCATCACC GACAGCCTGA ACCGCCGTGA AGTCCTGCAT ACCGCAGGCG AAGGCGGGCT GAAGCGGGTG GTGAAAAAGG AACACGCGGA CGGCAGCGTC ACGCAGAGTC AGTTTGACGC GGTGGGCAGG CTCAGGGCAC AGACGGATGC CGCAGGCAGG ACAACAGAAT ACAGCCCGGA TGTGGTGACG GGCCTCATCA CGCGCATCAC CACGCCGGAT GGCAGGGCAT CGGCGTTTTA CTATAACCAC CACAGCCAGT TAACGTCAGC CACCGGGCCT GACGGGCTGG AAATACGCCG GGAATACGAT GAATGGGGTA ACCAGCTGAA TGAAGAGAAC CCGCACCAGC TGCAGCAGCT CATCCGCCTG CCGGGGCAGC AGTATGATGA GGAGTCCGGC CTGTATTATA ACCGCCACCG CTATTATGAC CCGCTGCGGG GGAGGTATAT CACTCAGGAT CCGATTGGGC TGAAGGGGGG ATGGAACCTG TATACATATC CGCTGAGCCC GGTGAATAGC ATGGATCCAT TAGGATTATA TGAATTTAAA TCAAAAAATA TAGATGATAT TGGAATATTT GCATTGGCAA TGTGTAATGG AGAATCAATT AACGAGAATA AAGAATATGG TGGACTAATA TGTAAGAAGC AAGGTGAATA TTTCCCCATG AATCCGATAA GTTCAAATGA TAATGATAGT GTAGACTTGC GAAATATAAA ATGCCCTGAA GGTTCAGAGA GAGTAGGCGA TTATCACACT CACGGTTTTT ACTCTGACGA TAAAGGAAAT AAAGTAACAA AAGAAAATGA TGTTTATGAT AGTCTAAATT TTTCAAGCAA AGATTTAACG AATTCTTATA TGAATGGAAT GGGAAAAAAA GAATACAGTA GTTACTTGGG AACACCAAAT GACACCTATC TAAAATATAA TCCCAAAGCT AAAGGGAATG GAGTTACAAT TATCAGGCAA GGGAGTAATT AA
|
Protein sequence | MSGKPAARQG DMTQYGGSIV QGSAGVRIGA PTGVACSVCP GGVTSGHPVN PLLGAKVLPG ETDIALPGPL PFILSRTYSS YRTKTPAPVG SLGPGWKMPA DIRLQLRDNT LILSDNGGRS LYFEHLFPGE DGYSRSESLW LVRGGVLRLD EGHRLAALWQ ALPEELRLSP HRYLATNSPQ GPWWLLGWCE RVPEADEVLP APLPPYRVLT GLVDRFGRTQ TFHREAAGEF SGEITGVTDG AGRHFRLVLT TQAQRAEEAR QQAISGGTEP SAFPDTLPGY TEYGRDNGIR LSAVWLTHDP EYPENLPAAP LVRYGWTPRG ELAVVYDRSG KQVRSFTYDD KYRGRMVAHR HTGRPEIRYR YDSDGRVTEQ LNPAGLSYTY QYEKDRITIT DSLNRREVLH TAGEGGLKRV VKKEHADGSV TQSQFDAVGR LRAQTDAAGR TTEYSPDVVT GLITRITTPD GRASAFYYNH HSQLTSATGP DGLEIRREYD EWGNQLNEEN PHQLQQLIRL PGQQYDEESG LYYNRHRYYD PLRGRYITQD PIGLKGGWNL YTYPLSPVNS MDPLGLYEFK SKNIDDIGIF ALAMCNGESI NENKEYGGLI CKKQGEYFPM NPISSNDNDS VDLRNIKCPE GSERVGDYHT HGFYSDDKGN KVTKENDVYD SLNFSSKDLT NSYMNGMGKK EYSSYLGTPN DTYLKYNPKA KGNGVTIIRQ GSN
|
| |