Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ent638_3385 |
Symbol | |
ID | 5112370 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Enterobacter sp. 638 |
Kingdom | Bacteria |
Replicon accession | NC_009436 |
Strand | + |
Start bp | 3689287 |
End bp | 3692205 |
Gene Length | 2919 bp |
Protein Length | 972 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640493590 |
Product | outer membrane autotransporter |
Protein accession | YP_001178096 |
Protein GI | 146313022 |
COG category | [S] Function unknown |
COG ID | [COG4625] Uncharacterized protein with a C-terminal OMP (outer membrane protein) domain |
TIGRFAM ID | [TIGR01414] outer membrane autotransporter barrel domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.308218 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAA AATATCTCAG CCAGCTGATC TCCTTGCTGG TAGCTTCGAC GGCAGCACAG GGGCTGCTGA CCACCCATGC TTTAGCAGTT TCTGGTACAG TTATCGACTC AGCATTCACC AGCATTAATG TTGTCGGCAC AAATGACTCG CTGCATATCA CCGACACGGG CTCGATTACT GGCGCGCCGA CCACAGCATT AACCGTGGAA CAGAATGCCA CATTAGCCAC GCTGCTTAAT GATGGCACCA TCAGTGATGA CGGTACTAAC GGTAATAATT ATGACGTTAA CATCGTTCAG ATTAATGGCG CGGTAACGAC ATTCGAAAAT ACCGGGACTA TCTCGAGCAT TAATCAGTAT CACTACGGCA GCATAGTCGC AGTAGGCGCA TCAGGTGAAA TTGATAATTT CACGAATAGC GGCACGATTA AAAACGCGCC TGATAATTTT AATCCAGGCA TGAATAATAC CGGTGCTGTC GCGAATATTG GTTATATTAA AACGCTGACT AACACTGCTG ATGGTAAGAT CACCGGATAT ACTGGCATCA ACAACCAGGG CAGAATAGAT ACGTTATTAA ATGCAGGCGT GATTACCTCC GACACGGGTA ATTATGGGAT GATGATGGGA GATAATGCCG CCATTTATAA TAATATGAAT AGCAGCATTG GCACATTGCA TAATACCGGA ACCATCCAGT CCATTTCCCG TTATAGTTAT GATGGTGGCG GAATCTTTAA CTTTGGTACG ATTGATACCC TTATCAATGA TGATAAAATC ATTGGGGGTT CATTCGGTAT TCAAAACTAT GGTGTGATTG GTACTCTTGA GAACAACGGT AAGATTACCG CCACTAACTT TGGTATTTAT GCAAGCACCT CTAATACAAC CTCTATTGGC ACGATAGCTA ATAATGGTGA AATCAGCGGG GCGAACTACG GAATTTTAAT TTCGAGTTAT GACCAAAGTC TGGAAACAAA TATTATCAAC AATGGGTTAC TGAGCGGTAA GGAAGATGCG CTTTATCTCA GCGACAACAA CTCATCTTCC CTTGGTAATG TTACGTTAAC CAACAGCGGT GTGGTTGCGG GTAATATTAA TGCCAATAAC ACTTCACCGT TAAAAATTAA TGGTGGAACC ACCACCATGG GCACATTGAC CGGCCTGAAC GGCATTGGGA CCATTACCAG TACCCGCTCT AACGTGGAAT TTGGTACCGG CTCACTGCTG CTTAACGACA ATGTCGTCGC CAGCACGGTT GTCAACAATG CTGCGTCATT GCAGGTGAAT AACAGTATTA CTGTGACGGG CGATTACCAT CAGAAAGCCG CTGCCACGCT TACTTCGGGT ATCTCTGATG TTGCGATTTC CAGGACCGAT CTGATGGCCG AAACCGGCTA TGGTCGCCTG AATGTCAGCG GCAACGCCAC GTTTGATCAA GGTTCCAGCG TCAATCTGAT ACGCACAGGA AATACCTATA AATTCGCGGA AGGTCAGCGC TATGTGGTGG TGAATGCCAC GGGTGCAGAG ACGAATTACA ACGCGGATAA ACTGAAGTAT AAAGCCATCG GCTACCGTGG TGCGGTACAA GGTTCTGTCT TCGATGATGG CGAGAATAAA GCGCTGGTCT TAACCGTCGG TGCTGAGCAG ACAGTGACTC CGCCAGTTGT GACATCTCCA GTTGTCACTG CACCGGGAGT CACACCGCCT GTCGTGACAG CTCCGGTCAT CACACCACCG GTTGCGACAC CTCCGACGCA GCCAACTCAG CCGGATCGCG GTTTGGCCAC GATTCCAAGC GCTACCGCAT CACTCGGCGG TCTGGGGAAC TACACCGGGA TTGCATCTCC GCAATTACTG GAGCTTTTCA ACGCCTCGCT GGCGATTGAC AGCAAAAGTG AAGCGAACCG CGTAGGTGAG AGTTTGTCAC CGGGTCAAAA CATCAACACC AGTTCAGCCG CTGCGGTGGC AACGTCGACC GCTCAGGCCG TGGTGGGTGC GCACATTGAT GCAGTCCGTA ATCCAGCCAA CTCCGGTACC AGCGGCGTGG CGACGGGTGA TGACTACGCC AGCAACTGGA TTGTCTGGGG TCAACCGTTC GGCGGATATG CGCGTCAGGA CAGCACGGTT GAAGTGAGTG GTTACAGCGC GAAGTTTGGC GGCCTGATCA TGGGTGCAGA TCGTTCGCTG GGCGACGACT GGCGTTTGGG TGCGGCGGTG AACTACAGCA ATACGTCTGT TCACGGTAAA GGAAACCTGA ACGGAAATAC GTCGACGGCT GATAACTACG GCGTCATCGG TTACGCCGGC TTCACGGGCG ATCCGTGGTA TCTGAACTTG TCGGCGGGTG TGAACCGTCA GAACTATACC TCCGTTCGTC GCGCGGATTT CACCGGTTTC TCCGGCGCTG CGCAGGGTAA ATTCAACGGT CAATCGGTCA CGCTGCAGAC TGAATTCGGC TATCCGCTGA CGCTGCAAGC TGGCGTGGTT CTGACACCGC TTGCCAGCCT GACTTACGGC TATCAGCACG TTGATGGATA TAAAGAGACG GGTGGAAACG GTATGGCGCT GGATGTGGGC AGCAGCCATG CGCAGTCCGT CGTGAGCGAT ATCGGTGCGC GCATCGAGAA AACCTTCGCG ACCGGTCTTG GCAATCTGAC GCCATTCGCC CAGGTGACGT GGTTGCATCA GTACGATGAT CGCCAGGTCA GCAGCCGCGC GTCCTACGCC GCTGATACGG TTGGTGAAAC CAGCTTTACG ACCAAAGGCG CATCGCCAGT GGAAGACATG GCTGGCGTCG CGATCGGCAG CACGCTGTAT GAAGCGAACG AGCTGAACCT CGACGCTCGC TACGATCTGC AAGCGGGAGA GCGCTATCAG GCGCATACCT TCAGCCTGCG TCTGCGCAAA ATGTTCTAA
|
Protein sequence | MKKKYLSQLI SLLVASTAAQ GLLTTHALAV SGTVIDSAFT SINVVGTNDS LHITDTGSIT GAPTTALTVE QNATLATLLN DGTISDDGTN GNNYDVNIVQ INGAVTTFEN TGTISSINQY HYGSIVAVGA SGEIDNFTNS GTIKNAPDNF NPGMNNTGAV ANIGYIKTLT NTADGKITGY TGINNQGRID TLLNAGVITS DTGNYGMMMG DNAAIYNNMN SSIGTLHNTG TIQSISRYSY DGGGIFNFGT IDTLINDDKI IGGSFGIQNY GVIGTLENNG KITATNFGIY ASTSNTTSIG TIANNGEISG ANYGILISSY DQSLETNIIN NGLLSGKEDA LYLSDNNSSS LGNVTLTNSG VVAGNINANN TSPLKINGGT TTMGTLTGLN GIGTITSTRS NVEFGTGSLL LNDNVVASTV VNNAASLQVN NSITVTGDYH QKAAATLTSG ISDVAISRTD LMAETGYGRL NVSGNATFDQ GSSVNLIRTG NTYKFAEGQR YVVVNATGAE TNYNADKLKY KAIGYRGAVQ GSVFDDGENK ALVLTVGAEQ TVTPPVVTSP VVTAPGVTPP VVTAPVITPP VATPPTQPTQ PDRGLATIPS ATASLGGLGN YTGIASPQLL ELFNASLAID SKSEANRVGE SLSPGQNINT SSAAAVATST AQAVVGAHID AVRNPANSGT SGVATGDDYA SNWIVWGQPF GGYARQDSTV EVSGYSAKFG GLIMGADRSL GDDWRLGAAV NYSNTSVHGK GNLNGNTSTA DNYGVIGYAG FTGDPWYLNL SAGVNRQNYT SVRRADFTGF SGAAQGKFNG QSVTLQTEFG YPLTLQAGVV LTPLASLTYG YQHVDGYKET GGNGMALDVG SSHAQSVVSD IGARIEKTFA TGLGNLTPFA QVTWLHQYDD RQVSSRASYA ADTVGETSFT TKGASPVEDM AGVAIGSTLY EANELNLDAR YDLQAGERYQ AHTFSLRLRK MF
|
| |