Gene Ent638_3385 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_3385 
Symbol 
ID5112370 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp3689287 
End bp3692205 
Gene Length2919 bp 
Protein Length972 aa 
Translation table11 
GC content51% 
IMG OID640493590 
Productouter membrane autotransporter 
Protein accessionYP_001178096 
Protein GI146313022 
COG category[S] Function unknown 
COG ID[COG4625] Uncharacterized protein with a C-terminal OMP (outer membrane protein) domain 
TIGRFAM ID[TIGR01414] outer membrane autotransporter barrel domain 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.308218 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAA AATATCTCAG CCAGCTGATC TCCTTGCTGG TAGCTTCGAC GGCAGCACAG 
GGGCTGCTGA CCACCCATGC TTTAGCAGTT TCTGGTACAG TTATCGACTC AGCATTCACC
AGCATTAATG TTGTCGGCAC AAATGACTCG CTGCATATCA CCGACACGGG CTCGATTACT
GGCGCGCCGA CCACAGCATT AACCGTGGAA CAGAATGCCA CATTAGCCAC GCTGCTTAAT
GATGGCACCA TCAGTGATGA CGGTACTAAC GGTAATAATT ATGACGTTAA CATCGTTCAG
ATTAATGGCG CGGTAACGAC ATTCGAAAAT ACCGGGACTA TCTCGAGCAT TAATCAGTAT
CACTACGGCA GCATAGTCGC AGTAGGCGCA TCAGGTGAAA TTGATAATTT CACGAATAGC
GGCACGATTA AAAACGCGCC TGATAATTTT AATCCAGGCA TGAATAATAC CGGTGCTGTC
GCGAATATTG GTTATATTAA AACGCTGACT AACACTGCTG ATGGTAAGAT CACCGGATAT
ACTGGCATCA ACAACCAGGG CAGAATAGAT ACGTTATTAA ATGCAGGCGT GATTACCTCC
GACACGGGTA ATTATGGGAT GATGATGGGA GATAATGCCG CCATTTATAA TAATATGAAT
AGCAGCATTG GCACATTGCA TAATACCGGA ACCATCCAGT CCATTTCCCG TTATAGTTAT
GATGGTGGCG GAATCTTTAA CTTTGGTACG ATTGATACCC TTATCAATGA TGATAAAATC
ATTGGGGGTT CATTCGGTAT TCAAAACTAT GGTGTGATTG GTACTCTTGA GAACAACGGT
AAGATTACCG CCACTAACTT TGGTATTTAT GCAAGCACCT CTAATACAAC CTCTATTGGC
ACGATAGCTA ATAATGGTGA AATCAGCGGG GCGAACTACG GAATTTTAAT TTCGAGTTAT
GACCAAAGTC TGGAAACAAA TATTATCAAC AATGGGTTAC TGAGCGGTAA GGAAGATGCG
CTTTATCTCA GCGACAACAA CTCATCTTCC CTTGGTAATG TTACGTTAAC CAACAGCGGT
GTGGTTGCGG GTAATATTAA TGCCAATAAC ACTTCACCGT TAAAAATTAA TGGTGGAACC
ACCACCATGG GCACATTGAC CGGCCTGAAC GGCATTGGGA CCATTACCAG TACCCGCTCT
AACGTGGAAT TTGGTACCGG CTCACTGCTG CTTAACGACA ATGTCGTCGC CAGCACGGTT
GTCAACAATG CTGCGTCATT GCAGGTGAAT AACAGTATTA CTGTGACGGG CGATTACCAT
CAGAAAGCCG CTGCCACGCT TACTTCGGGT ATCTCTGATG TTGCGATTTC CAGGACCGAT
CTGATGGCCG AAACCGGCTA TGGTCGCCTG AATGTCAGCG GCAACGCCAC GTTTGATCAA
GGTTCCAGCG TCAATCTGAT ACGCACAGGA AATACCTATA AATTCGCGGA AGGTCAGCGC
TATGTGGTGG TGAATGCCAC GGGTGCAGAG ACGAATTACA ACGCGGATAA ACTGAAGTAT
AAAGCCATCG GCTACCGTGG TGCGGTACAA GGTTCTGTCT TCGATGATGG CGAGAATAAA
GCGCTGGTCT TAACCGTCGG TGCTGAGCAG ACAGTGACTC CGCCAGTTGT GACATCTCCA
GTTGTCACTG CACCGGGAGT CACACCGCCT GTCGTGACAG CTCCGGTCAT CACACCACCG
GTTGCGACAC CTCCGACGCA GCCAACTCAG CCGGATCGCG GTTTGGCCAC GATTCCAAGC
GCTACCGCAT CACTCGGCGG TCTGGGGAAC TACACCGGGA TTGCATCTCC GCAATTACTG
GAGCTTTTCA ACGCCTCGCT GGCGATTGAC AGCAAAAGTG AAGCGAACCG CGTAGGTGAG
AGTTTGTCAC CGGGTCAAAA CATCAACACC AGTTCAGCCG CTGCGGTGGC AACGTCGACC
GCTCAGGCCG TGGTGGGTGC GCACATTGAT GCAGTCCGTA ATCCAGCCAA CTCCGGTACC
AGCGGCGTGG CGACGGGTGA TGACTACGCC AGCAACTGGA TTGTCTGGGG TCAACCGTTC
GGCGGATATG CGCGTCAGGA CAGCACGGTT GAAGTGAGTG GTTACAGCGC GAAGTTTGGC
GGCCTGATCA TGGGTGCAGA TCGTTCGCTG GGCGACGACT GGCGTTTGGG TGCGGCGGTG
AACTACAGCA ATACGTCTGT TCACGGTAAA GGAAACCTGA ACGGAAATAC GTCGACGGCT
GATAACTACG GCGTCATCGG TTACGCCGGC TTCACGGGCG ATCCGTGGTA TCTGAACTTG
TCGGCGGGTG TGAACCGTCA GAACTATACC TCCGTTCGTC GCGCGGATTT CACCGGTTTC
TCCGGCGCTG CGCAGGGTAA ATTCAACGGT CAATCGGTCA CGCTGCAGAC TGAATTCGGC
TATCCGCTGA CGCTGCAAGC TGGCGTGGTT CTGACACCGC TTGCCAGCCT GACTTACGGC
TATCAGCACG TTGATGGATA TAAAGAGACG GGTGGAAACG GTATGGCGCT GGATGTGGGC
AGCAGCCATG CGCAGTCCGT CGTGAGCGAT ATCGGTGCGC GCATCGAGAA AACCTTCGCG
ACCGGTCTTG GCAATCTGAC GCCATTCGCC CAGGTGACGT GGTTGCATCA GTACGATGAT
CGCCAGGTCA GCAGCCGCGC GTCCTACGCC GCTGATACGG TTGGTGAAAC CAGCTTTACG
ACCAAAGGCG CATCGCCAGT GGAAGACATG GCTGGCGTCG CGATCGGCAG CACGCTGTAT
GAAGCGAACG AGCTGAACCT CGACGCTCGC TACGATCTGC AAGCGGGAGA GCGCTATCAG
GCGCATACCT TCAGCCTGCG TCTGCGCAAA ATGTTCTAA
 
Protein sequence
MKKKYLSQLI SLLVASTAAQ GLLTTHALAV SGTVIDSAFT SINVVGTNDS LHITDTGSIT 
GAPTTALTVE QNATLATLLN DGTISDDGTN GNNYDVNIVQ INGAVTTFEN TGTISSINQY
HYGSIVAVGA SGEIDNFTNS GTIKNAPDNF NPGMNNTGAV ANIGYIKTLT NTADGKITGY
TGINNQGRID TLLNAGVITS DTGNYGMMMG DNAAIYNNMN SSIGTLHNTG TIQSISRYSY
DGGGIFNFGT IDTLINDDKI IGGSFGIQNY GVIGTLENNG KITATNFGIY ASTSNTTSIG
TIANNGEISG ANYGILISSY DQSLETNIIN NGLLSGKEDA LYLSDNNSSS LGNVTLTNSG
VVAGNINANN TSPLKINGGT TTMGTLTGLN GIGTITSTRS NVEFGTGSLL LNDNVVASTV
VNNAASLQVN NSITVTGDYH QKAAATLTSG ISDVAISRTD LMAETGYGRL NVSGNATFDQ
GSSVNLIRTG NTYKFAEGQR YVVVNATGAE TNYNADKLKY KAIGYRGAVQ GSVFDDGENK
ALVLTVGAEQ TVTPPVVTSP VVTAPGVTPP VVTAPVITPP VATPPTQPTQ PDRGLATIPS
ATASLGGLGN YTGIASPQLL ELFNASLAID SKSEANRVGE SLSPGQNINT SSAAAVATST
AQAVVGAHID AVRNPANSGT SGVATGDDYA SNWIVWGQPF GGYARQDSTV EVSGYSAKFG
GLIMGADRSL GDDWRLGAAV NYSNTSVHGK GNLNGNTSTA DNYGVIGYAG FTGDPWYLNL
SAGVNRQNYT SVRRADFTGF SGAAQGKFNG QSVTLQTEFG YPLTLQAGVV LTPLASLTYG
YQHVDGYKET GGNGMALDVG SSHAQSVVSD IGARIEKTFA TGLGNLTPFA QVTWLHQYDD
RQVSSRASYA ADTVGETSFT TKGASPVEDM AGVAIGSTLY EANELNLDAR YDLQAGERYQ
AHTFSLRLRK MF