Gene Ent638_2219 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_2219 
Symbol 
ID5112911 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp2410224 
End bp2413409 
Gene Length3186 bp 
Protein Length1061 aa 
Translation table11 
GC content53% 
IMG OID640492405 
Productfibronectin, type III domain-containing protein 
Protein accessionYP_001176944 
Protein GI146311870 
COG category[S] Function unknown 
COG ID[COG4733] Phage-related protein, tail component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAATG CTACCGCGAT TAAAGGCCGC AAAGGCGGCA GTTCAAGTTC ACGAACCCCG 
ACAGAACAGC CTGATGATCT TCAGTCTGTA GCGAAAGCAA AAATTCTTAT AGCGCTTGGA
GAGGGTGAGT TTGCCGGGCA ACTTACAGGC AGGGACATTT ACCTGGACGG GACACCTCTT
GAGAGTGCTA ATGGCGCACA AAATTTCAGT GGTGTTGCCT GGGAGTTTCG ACCGGGTAAT
CAGGCGCAGA ATTACATTCA GGGCATTCCC GGAACGGAAA ATGAAATTAG TGTCGGCACG
GAGGTTTCCA GCACCACAGC GTGGACAAGA ACTTTCACAA ATACCCAACT CTCTGCTGTG
CGGGTTCGTT TAAAATGGCC TTCGCTATTT AAGCAGGAAA ATGATGGGGA CCTGGTCGGG
TATTCCATTA ACTATGCCAT CGATCTTCAG ACAGATGGCG GCGCGTGGCA GGCTGTAATT
AATACCCGTG TTACCGGGAA AACGACCTCA GGCTATGAAC GTAGCCACCG TATTGATTTG
CCCAGTGCTG GCAGCACCTG GTCTCTACGT CTTCGTAAAG TAACAGCGGA TGCCAATAGC
GCCAAAATTG GCGACACGAT GACGATCCAG AGCTTCACTG AGGTCATCGA CGCCAAATTG
CGCTACCCGA ACACGGCATT GTTGTATATC GAATTTGACT CCAGCCAGTT CAACGGCTCT
ATCCCTCAAA TATCCTGCGA GCCTCGCGGT CGGGTCATTC GCGTACCTGA CACATACGAC
CCTGTAAATC GCACCTATAG CGGCACGTGG ACAGGGGCAT TTAAATGGGC GTGGACTGAT
AACCCAGCGT GGGTATTTTA TGATCTCGTC GTTACCGAGC GTTTTGGTTT GGGAAACCGG
CTAACTGCCG CGAACATCGA TAAATGGGGA CTGTATCAGA TTGCTCAGTA TTGTGATCAA
CGCGTCCCTG ACGGAAAAGG CGGAAGTGGC ACAGAGCCGC GCTATATCTG TAATGTGTAT
GTGCAGAACA GGAATGAAGC CTATACGGTT CTTCGTGATT TTGCTGCTAT TTTTCGGGGC
ATGACTTACT GGGGCGGGGA TCAGATTGTT AGCCTGGCAG ACATGCCGCG CGATATTGAT
TACAGCTACA CACGCGCCAA CGTCATTGAT GGCCAGTTTG CATATTCCAG CAGTACTACA
AAGACACGCT ATACCACTGC GCTCGTGTCT TACTCTGATC CAGATAATGC CTATGCTGAT
GCAATGGAGC CTGTGTTTGA ACAGGCGCTG GTATCACGAT ATGGCTTCAA TCAGCTCGAA
TTAACGGCTA TTGGCTGCAC GCGACAGTCA GAGGCAAACC GTAAAGGGCG CTGGGGGATA
CTGACCAACA ATAAAGATCG CATCGTTTCT TTCTCTGTTG GGCTTGATGG CAATATTCCG
CAGCCGGGTT ACATCATTGC TGTGGCGGAT GAAATGCTAT CCGGGAAGGT AACCGGAGGG
CGTATCAGCG CGGTGAATGG CAGGGTTATC AGGTTAGATC GTGTTGCTGA TGTGGAAGCC
GGTAACCGGT TAATAGTCAA TCTGCCATCG GGCGCTTCAC AGAGCCGTAC GGTTCAGGCT
GTAAATGGTG AAACCGTAAC GGTCACCACC GCATACAGTG AAACCCCGCA GCCAGAAAGC
GTGTGGGTCG TGGAATCAAA TGAGTTATAT GCGCAGCAGT ATCGCGTCGT GAGTGTTTCA
GATAACAATG ATGGCACCTT CACGATTACG GGCGCGTATC ATGACCCCGA TAAGTATGCC
CGCATTGATA CCGGCGCAAT CATCGATCAA CGCCCCGTCA GCGTTATCCC GCCTGGCAAC
CAGGCACCAC CAGACAACAT CATCATCAGT TCTTTCTCCG TCGTGCAGCA GGGCATCAGT
GTCGAAACCA TGCGCGCCAG CTGGGATCAG GCACCCAATG CCATTTCTTA TGAGGCGCAG
TGGCGTCGCA ACGACGGCAA CTGGGTCAAC GTACCGCGCA GCTCCACCAC CTCGTTTGAC
GTACCGGGAA TTTACGCCGG ACGCTACCTG GTGCGCGTGC GCGCCATCAA CGCCGCTGAA
ATTTCTTCAG GCTGGGGTTA CTCGCAGGAG AAAACACTGA CGGGCAAAGT GGGGAATCCA
CCGAAGCCGG TGGGCTTTAT GGCAACGGGC ATTAACTGGG GGATTCGTCT GAACTGGGGA
TTCCCGGCCA ATACGGCGGA TACGTTAAAA ACGGAAATCC AGTACACGGC CAACAGCGAC
TTTTCCGATC CGTTGCTGCT GTCTGATGTG CCGTACCCGT CTGCCGAATA TACACAGCTC
GGGTTGAGAG CGGGGCAGGA ATTCTGGTAC CGCGCGCAGC TGGTGGACAA GACCGGGAAT
GAATCAGGAT ACACCGACTG GATCAGAGGG ATGTCCAACG ATAACGCCGA TGACTACCTG
GGCGATATTG CGGATGAATT CCTGAGCGCT GCTGATGGTG AGCGACTGAC GAGCGACATC
GACACCAACC TTGAAGCAGC GTTGCAGAAT GCGCTGGCTA ACCATGGAAC TGTCGAGCAT
CAGTGGGCGC AATACGGTGA GGTTCGCGCC GATATCCTGA TTGTTAAAAC GACCGTTGCT
GAAGTTGACA AGGCGATGGC CGAACTGTCA ACGACGGTTC AGGCCCAGAT AGGGGAAGTC
ACTGCCTCGC TGGAGGATAA GCTGACGGCC ACGGTGGATG CCAACGGTGC TACAGCAATC
CATACGCTGA AAGCAGGTGT CCGCATTAAC GGAGTGATGT ACAACGCCGG GATGTCGATT
GCTGTGCTGG CGGAGGCGGG TAAACCGGTT GTCACCCGTG TCGGTTTTAA TGCCAACCAG
TTTGTACTGA TGAGCGGCAG CGGCGACACG CAGTATTCTC CGTTCGCTGT TTACAATGGC
CAGGTCTTTA TCAGTGATGC CTTTATCCAG GACGGCACGA TCACCAACGC CAAAATTGGC
TCTTTTATTC AGTCGAACAA TTACGTTCCT GACCAGGCGG GGTGGCGGCT CGATAAGGCG
GGAACGTGGG TTAACTTCGG GAGCGACGCT GCGGGGGCGA GAAAGACAAC GAACGTCACA
GACAGCATCA GGGACAGTAA CGGTGTTCTT CGCGTGCAAA TCGGAAAGCT GACAGGGGTG
TTTTAA
 
Protein sequence
MANATAIKGR KGGSSSSRTP TEQPDDLQSV AKAKILIALG EGEFAGQLTG RDIYLDGTPL 
ESANGAQNFS GVAWEFRPGN QAQNYIQGIP GTENEISVGT EVSSTTAWTR TFTNTQLSAV
RVRLKWPSLF KQENDGDLVG YSINYAIDLQ TDGGAWQAVI NTRVTGKTTS GYERSHRIDL
PSAGSTWSLR LRKVTADANS AKIGDTMTIQ SFTEVIDAKL RYPNTALLYI EFDSSQFNGS
IPQISCEPRG RVIRVPDTYD PVNRTYSGTW TGAFKWAWTD NPAWVFYDLV VTERFGLGNR
LTAANIDKWG LYQIAQYCDQ RVPDGKGGSG TEPRYICNVY VQNRNEAYTV LRDFAAIFRG
MTYWGGDQIV SLADMPRDID YSYTRANVID GQFAYSSSTT KTRYTTALVS YSDPDNAYAD
AMEPVFEQAL VSRYGFNQLE LTAIGCTRQS EANRKGRWGI LTNNKDRIVS FSVGLDGNIP
QPGYIIAVAD EMLSGKVTGG RISAVNGRVI RLDRVADVEA GNRLIVNLPS GASQSRTVQA
VNGETVTVTT AYSETPQPES VWVVESNELY AQQYRVVSVS DNNDGTFTIT GAYHDPDKYA
RIDTGAIIDQ RPVSVIPPGN QAPPDNIIIS SFSVVQQGIS VETMRASWDQ APNAISYEAQ
WRRNDGNWVN VPRSSTTSFD VPGIYAGRYL VRVRAINAAE ISSGWGYSQE KTLTGKVGNP
PKPVGFMATG INWGIRLNWG FPANTADTLK TEIQYTANSD FSDPLLLSDV PYPSAEYTQL
GLRAGQEFWY RAQLVDKTGN ESGYTDWIRG MSNDNADDYL GDIADEFLSA ADGERLTSDI
DTNLEAALQN ALANHGTVEH QWAQYGEVRA DILIVKTTVA EVDKAMAELS TTVQAQIGEV
TASLEDKLTA TVDANGATAI HTLKAGVRIN GVMYNAGMSI AVLAEAGKPV VTRVGFNANQ
FVLMSGSGDT QYSPFAVYNG QVFISDAFIQ DGTITNAKIG SFIQSNNYVP DQAGWRLDKA
GTWVNFGSDA AGARKTTNVT DSIRDSNGVL RVQIGKLTGV F