Gene EcolC_2424 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2424 
Symbol 
ID6068472 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2669655 
End bp2672522 
Gene Length2868 bp 
Protein Length955 aa 
Translation table11 
GC content50% 
IMG OID641601833 
Productouter membrane autotransporter 
Protein accessionYP_001725385 
Protein GI170020431 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3468] Type V secretory pathway, adhesin AidA 
TIGRFAM ID[TIGR01414] outer membrane autotransporter barrel domain 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.33795 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCATCA AACAACACAA TGGGAATACC AAAGCAGATC GTCTCGCTGA ATTAAAAATC 
CGTTCGCCCT CAATTCAACT GATAAAATTT GGCGCTATTG GTTTGAATGC AATTATATTT
TCCCCCCTGC TGATAGCTGC TGATACAGGA AGTCAATATG GCACCAATAT TACTATTAAT
GATGGTGACA GAATTACAGG AGATACCGCC GATCCATCAG GAAACCTCTA TAGTGTAATG
ACCCCAGCAG GAAACACGCC TGGCAATATC AACCTGGGTA ATGATGTCAC CGTCAATGTC
AACGACGCCT CTGGATATGC AAAAGGAATC ATTATTCAGG GCAAAAACAG CTCCCTGACA
GCTAACCGAC TCACAGTAGA TGTTGTTGGT CAAACCTCTG CCATCGGCAT TAACTTAATT
GGTGACTATA CCCATGCTGA CTTAGGCACA GGCAGCACCA TTAAGAGTAA CGATGACGGC
ATCATTATTG GGCATAGCTC AACACTAACA GCCACTCAAT TCACCATTGA AAACTCGAAC
GGTATAGGCC TAACCATCAA TGACTATGGC ACCAGTGTCG ATCTTGGAAG CGGAAGTAAA
ATCACGACCG ATGGAAGTAC AGGTGTTTAT ATCGGTGGTC TCAACGGCAA TAACGCCAAT
GGTGCTGCGC GTTTTACGGC TACAGACCTG ACAATCGATG TTCAGGGCTA CAGCGCCATG
GGGATAAACG TACAGAAAAA CTCTGTTGTC GATCTCGGAA CAAACAGTAC CATTAAAACC
AATGGCGATA ATGCTCACGG CCTCTGGAGC TTTGGCCAGG TTAGCGCGAA TGCACTCACT
GTTGATGTAA CTGGAGCCGC GGCCAATGGC GTCGAAGTTC GTGGTGGTAC AACCACTATC
GGTGCAGATA GCCATATTTC TTCCGCGCAG GGCGGTGGCC TCGTCACCAG TGGTTCAGAC
GCGATAATCA ATTTTACTGG CACGGCAGCG CAACGAAACA GCATCTTTTC CGGCGGTTCT
TATGGTGCCT CGGCCCAGAC GGCAACGGCT GTTGTCAACA TGCAAAATAC CGATATTACA
GTTGATCGTA ATGGCAGTCT GGCGCTGGGT TTGTGGGCTC TCAGCGGCGG TAGAATAACC
GGAGACAGTT TGGCTATCAC CGGCGCGGCA GGAGCCAGGG GAATTTATGC CATGACCAAC
AGCCAGATCG ACCTCACGAG CGATCTGGTC ATTGATATGA GTACACCCGA CCAGATGGCC
ATCGCAACGC AACATGACGA TGGTTATGCC GCCAGCCGCA TCAACGCCTC GGGTCGTATG
CTTATCAACG GTAGCGTTCT TTCCAAAGGT GGGCTAATCA ATCTGGATAT GCACCCTGGG
TCGGTTTGGA CAGGTTCCTC CCTCAGCGAT AATGTCAATG GCGGAAAACT GGACGTTGCA
ATGAATAACA GCGTCTGGAA CGTAACAAGT AATTCTAATC TCGACACGCT GGCGCTGAGC
CATTCAACTG TCGATTTTGC CAGCCACGGG TCAACTGCCG GCACATTTGC CACATTAAAC
GTAGAGAACC TGAGCGGTAA CAGTACCTTT ATTATGCGTG CTGATGTTGT TGGCGAGGGT
AATGGCGTTA ATAATAAAGG GGATTTATTG AATATCAGCG GGAGTAGTGC TGGTAATCAC
GTATTGGCTA TCCGCAACCA GGGCAGCGAG GCCACAACGG GAAATGAAGT TCTGACAGTG
GTAAAAACCA CTGACGGCGC GGCCTCGTTC AGCGCGTCTT CTCAGGTTGA GTTGGGGGGA
TATCTGTACG ATGTGCGTAA AAATGGCACT AACTGGGAGC TTTACGCTTC CGGGACAGTT
CCGGAACCGA CTCCTAATCC TGAACCCACA CCAGCTCCCG CTCAGCCTCC CATAGTCAAC
CCCGATCCTA CGCCTGAACC CGCTCCCACG CCTAAACCCA CCACGACCGC AGATGCTGGC
GGCAATTATC TCAATGTCGG TTACTTATTG AACTATGTTG AAAACCGTAC GCTGATGCAA
CGGATGGGTG ACCTGCGAAA TCAGAGTAAA GACGGTAATA TCTGGTTGCG CAGTTATGGG
GGAAGCCTGG ACTCCTTTGC CAGTGGCAAA CTGAGCGGCT TTGACATGGG TTACAGCGGT
ATCCAGTTTG GTGGGGATAA ACGTCTCTCT GATGTAATGC CGTTGTATGT CGGTCTGTAT
ATTGGCTCAA CACATGCATC GCCGGACTAT AGCGGAGGCG ACGGTACCGC ACGTTCAGAC
TACATGGGAA TGTACGCCAG TTACATGGCA CAAAACGGTT TTTACAGCGA TCTCGTTATA
AAAGCATCGC GCCAGAAAAA TAGTTTCCAC GTACTGGACA GTCAGAACAA CGGCGTTAAC
GCCAACGGCA CTGCGAATGG AATGAGCATC TCCCTGGAAG CCGGGCAGAG GTTCAACCTG
TCCCCTACTG GTTATGGGTT CTATATAGAG CCGAAAACCC AGCTTACATA CAGCCACCAG
AATGAGATGA CTATGAAGGC GAGTAATGGC CTCAATATAC ATCTGAATCA CTACGAATCG
CTGCTGGGGC GTGCCAGCAT GATACTGGGG TATGACATCA CCGCAGGCAA CAGCCAGCTG
AATGTCTATG TGAAGACTGG CGCTATCCGC GAGTTTTCAG GGGATACCGA ATATCTGTTG
AACAACTCCC GGGAGAAGTA CAGTTTCAAA GGTAATGGCT GGAATAACGG CGTGGGAGTC
AGTGCACAGT ATAACAAACA GCACACATTC TATCTCGAAG CGGATTACAC GCAGGGTAAC
CTCTTTGATC AGAAGCAAGT CAACGGAGGA TATCGCTTCA GCTTTTAA
 
Protein sequence
MGIKQHNGNT KADRLAELKI RSPSIQLIKF GAIGLNAIIF SPLLIAADTG SQYGTNITIN 
DGDRITGDTA DPSGNLYSVM TPAGNTPGNI NLGNDVTVNV NDASGYAKGI IIQGKNSSLT
ANRLTVDVVG QTSAIGINLI GDYTHADLGT GSTIKSNDDG IIIGHSSTLT ATQFTIENSN
GIGLTINDYG TSVDLGSGSK ITTDGSTGVY IGGLNGNNAN GAARFTATDL TIDVQGYSAM
GINVQKNSVV DLGTNSTIKT NGDNAHGLWS FGQVSANALT VDVTGAAANG VEVRGGTTTI
GADSHISSAQ GGGLVTSGSD AIINFTGTAA QRNSIFSGGS YGASAQTATA VVNMQNTDIT
VDRNGSLALG LWALSGGRIT GDSLAITGAA GARGIYAMTN SQIDLTSDLV IDMSTPDQMA
IATQHDDGYA ASRINASGRM LINGSVLSKG GLINLDMHPG SVWTGSSLSD NVNGGKLDVA
MNNSVWNVTS NSNLDTLALS HSTVDFASHG STAGTFATLN VENLSGNSTF IMRADVVGEG
NGVNNKGDLL NISGSSAGNH VLAIRNQGSE ATTGNEVLTV VKTTDGAASF SASSQVELGG
YLYDVRKNGT NWELYASGTV PEPTPNPEPT PAPAQPPIVN PDPTPEPAPT PKPTTTADAG
GNYLNVGYLL NYVENRTLMQ RMGDLRNQSK DGNIWLRSYG GSLDSFASGK LSGFDMGYSG
IQFGGDKRLS DVMPLYVGLY IGSTHASPDY SGGDGTARSD YMGMYASYMA QNGFYSDLVI
KASRQKNSFH VLDSQNNGVN ANGTANGMSI SLEAGQRFNL SPTGYGFYIE PKTQLTYSHQ
NEMTMKASNG LNIHLNHYES LLGRASMILG YDITAGNSQL NVYVKTGAIR EFSGDTEYLL
NNSREKYSFK GNGWNNGVGV SAQYNKQHTF YLEADYTQGN LFDQKQVNGG YRFSF