Gene ECD_02907 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_02907 
SymboltolC 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp3045552 
End bp3047033 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content51% 
IMG OID 
Productouter membrane channel precursor protein 
Protein accessionACT44711 
Protein GI253979041 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAAT TGCTCCCCAT TCTTATCGGC CTGAGCCTTT CTGGGTTCAG TTCGTTGAGC 
CAGGCCGAGA ACCTGATGCA AGTTTATCAG CAAGCACGCC TTAGTAACCC GGAATTGCGT
AAGTCTGCCG CCGATCGTGA TGCTGCCTTT GAAAAAATTA ATGAAGCGCG CAGTCCATTA
CTGCCACAGC TAGGTTTAGG TGCAGATTAC ACCTATAGCA ACGGCTACCG CGACGCGAAC
GGCATCAACT CTAACGCGAC CAGTGCGTCC CTGCAGTTAA CTCAATCCAT TTTTGATATG
TCGAAATGGC GTGCGTTAAC GCTGCAGGAA AAAGCAGCAG GGATTCAGGA CGTCACGTAT
CAGACCGATC AGCAAACCTT GATCCTCAAC ACCGCGACCG CTTATTTCAA CGTGTTGAAT
GCTATTGACG TTCTTTCCTA TACACAGGCA CAAAAAGAAG CGATCTACCG TCAATTAGAT
CAAACCACCC AACGTTTTAA CGTGGGCCTG GTAGCGATCA CCGACGTGCA GAACGCCCGC
GCACAGTACG ATACCGTGCT GGCGAACGAA GTGACCGCAC GTAATAACCT TGATAACGCG
GTAGAGCAGC TGCGCCAGAT CACCGGTAAC TACTATCCGG AACTGGCTGC GCTGAATGTC
GAAAACTTTA AAACCGACAA ACCACAGCCG GTTAACGCGC TGCTGAAAGA AGCCGAAAAA
CGCAACCTGT CGCTGTTACA GGCACGCTTG AGCCAGGACC TGGCGCGCGA GCAAATTCGC
CAGGCGCAGG ATGGTCACTT ACCGACTCTG GATTTAACGG CTTCTACCGG GATTTCTGAC
ACCTCTTATA GCGGTTCGAA AACCCGTGGT GCCGCTGGTA CCCAGTATGA CGATAGCAAT
ATGGGCCAGA ACAAAGTTGG CCTGAGCTTC TCGCTGCCGA TTTATCAGGG CGGAATGGTT
AACTCGCAGG TGAAACAGGC ACAGTACAAC TTTGTCGGTG CCAGCGAGCA ACTGGAAAGT
GCCCATCGTA GCGTCGTGCA GACCGTGCGT TCCTCCTTCA ACAACATTAA TGCATCTATC
AGTAGCATTA ACGCCTACAA ACAAGCCGTA GTTTCCGCTC AAAGCTCATT AGACGCGATG
GAAGCGGGCT ACTCGGTCGG TACGCGTACC ATTGTTGATG TGTTGGATGC GACCACCACG
TTGTACAACG CCAAGCAAGA GCTGGCGAAT GCGCGTTATA ACTACCTGAT TAATCAGCTG
AATATTAAGT CAGCTCTGGG TACGTTGAAC GAGCAGGATC TGCTGGCACT GAACAATGCG
CTGAGCAAAC CGGTTTCCAC TAATCCGGAA AACGTTGCAC CGCAAACGCC GGAACAGAAT
GCTATTGCTG ATGGTTATGC GCCTGATAGC CCGGCACCAG TCGTTCAGCA AACATCCGCA
CGCACTACCA CCAGTAACGG TCATAACCCT TTCCGTAACT GA
 
Protein sequence
MKKLLPILIG LSLSGFSSLS QAENLMQVYQ QARLSNPELR KSAADRDAAF EKINEARSPL 
LPQLGLGADY TYSNGYRDAN GINSNATSAS LQLTQSIFDM SKWRALTLQE KAAGIQDVTY
QTDQQTLILN TATAYFNVLN AIDVLSYTQA QKEAIYRQLD QTTQRFNVGL VAITDVQNAR
AQYDTVLANE VTARNNLDNA VEQLRQITGN YYPELAALNV ENFKTDKPQP VNALLKEAEK
RNLSLLQARL SQDLAREQIR QAQDGHLPTL DLTASTGISD TSYSGSKTRG AAGTQYDDSN
MGQNKVGLSF SLPIYQGGMV NSQVKQAQYN FVGASEQLES AHRSVVQTVR SSFNNINASI
SSINAYKQAV VSAQSSLDAM EAGYSVGTRT IVDVLDATTT LYNAKQELAN ARYNYLINQL
NIKSALGTLN EQDLLALNNA LSKPVSTNPE NVAPQTPEQN AIADGYAPDS PAPVVQQTSA
RTTTSNGHNP FRN