Gene Ent638_4239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_4239 
Symbol 
ID5110444 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009425 
Strand
Start bp53795 
End bp56797 
Gene Length3003 bp 
Protein Length1000 aa 
Translation table11 
GC content59% 
IMG OID640480856 
Productvirulence protein SrfB 
Protein accessionYP_001165518 
Protein GI146284565 
COG category[S] Function unknown 
COG ID[COG4457] Uncharacterized protein conserved in bacteria, putative virulence factor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.104797 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGCTG AACTTACTGA TTTTAAAAAG CAGGTCAAAA TTATTCGTGA CAGTGGCATC 
CAGTTCCTGG ATTTCGCGTT TACGCTGCCT GCCCGTAAAG AGTACGGTGA TTTTTTGCGC
CAGAACGGCG ATGGCGCCAT CCTGCGCCTG GTCTATAACG AACGCGAAGA TAAACTGCAA
ATCCCGGCGG CAGATAATGC CGCGCCACAA TTTGCTGAGT CGGATTACTC TCTGACCACG
GAAGAGTCAC TGCACCTCTA TCAGGGGCTG TGGCTCCCGC TGCCCTTCTT CCGCTTTAAC
CCGCCGCGTG CGTTTGCACA CGGGCCAACC AACTGGGCGC GCGTGCAGTT CCACGAGCTG
GCAGAACCCG ACGAGAAGGG CAACACCTGG CGCGCGATCC TGATCTTCGA TACCAAAATT
TTCCCGGATC GCGATAACAC GCAATATCTG GCTCCCAGCG AAGATGACGT GCGCTCTGGC
GCCGGTTTTG CGCTGGCCCT GCACCCGCAC GAGATGGGCG ATTTCCTGAC CCTCCCGTGG
GTAGACGAGT GGTTGCGTGA AGTGTTCAGC ACCCAGGCGC GCGACGTGCT GCGCCAGCAT
GCGGAAGATA TTGATGAAAA ACTGGGGCAA AAAGAGCACC AGGCGCACTA CCTCAACCTG
CTGAATATCC TTGATGCCAC CGTGGCCATC CCGGAAGTGC AGGTCAATGA CGTGAAGATC
CGCGACTCGG CCATTCCGGT TGATTTGGTG CTGGATATCG GCAACTCCCG CAGTTGCGGA
ATCCTGATTG AAGAGCACCG CGACGACAAC AAAGGTCTGT CCCAGCTTTA CCAGCTGCAA
CTTCGCGACT TAAGCCAGCC GCAAAACGTC TACAACGAAC CCTTCGACAG CCGCCTGGAG
TTTGCGCAGG CCGAGTTTGG CAAGCAGGAC TTTTCGCTGA AAAGCGGACG CAGCGACGCC
TTCACCTGGC CGACCATTGG CCGCGTGGGC GATGAGGCCT TCCGCATGGC AGCACAGCGC
CTGGGAACAG AAGGCTCAAC CGGGATCTCC AGCCCGAAAC GCTATCTGTG GGACGACCAG
CCTTACTCCC CCGGCTGGCG TTTCAGCCAG GCGTTCGTCA AATCCGATCG CGAGCCGCTC
GCCACCGCCG CCCCGCTGCT GTACATGCTT AACGACCAGG GCAAGCTGCT GATTCGTCTG
CCTGAAGATG AGAGGATGCC GGTCTTCTCC CCGGTTTACA GCCGCAGCTC GTTAATGACC
ATGATGCTCT CTGAAGTCCT GAGCCAGGCG CTGATGCAGA TCAACAGCCC GGCGCAGCGT
CTGAAGATGA ACCACGCCAG CACGCCGCGC CGCTTGCGCA ATGTGATCAT GACCGTCCCG
CCCGCGATGC CAAAACCTGA ACGCGCCATC TTTGAGCAAT GCATGCTGGA TGCCATCCGT
CTGGTGTGGA AAGCGCTGGG CTGGGAGGAG ATGGACGACG AAAGTGAAGA CAACGAACAG
CTGAAACATC CGCGACCTGG CGTTCACGTG AAGTGGGACG AAGCCACCTG CGGACAGCTG
GTCTATCTGT ACAACGAAAC CCAGACCTAC TTTGGTGGGC GTACCGATGA GTTCTTTGCG
GCGACCCGTC GCCCGGACAA CGCCCCTGCC GCAAACGCGC CGCGCAGCCT GAAGGTGGCC
TCCATTGATA TTGGCGGCGG CACCACTGAC CTGGTGATCT CGCGCTATAC GCTGGATGAC
GGCGAAGGCA TCAACGTCCG TATCACGCCA AAACAGCTGT TCCGCGAAGG CTTCAAAGTG
GCGGGAGACG ACATCCTGCT GGATGTGATC CGCCTGTACC TGCAACCGGC TGTGAAAGCC
GCGATCGTGA AGGTTGGCCA TAGCGATATG ACCGCAGAAT CGATGATGTC GCAGCTGTTT
GGCAGCGAAT CCATCGAGGC GGGCAAGCAG GTTCTGCGTC AGCAGCTGAC CCTGCAGATC
TTCGCCCCAC TGGCGCTGGC GATCCTGCAT CGCTATGAAG AGTATTCGCC GGAGAGCGGC
CGCGAGCAGC TCAGCTTCAC GTTCCGCGAT CTGCTGACCG AAAACATGCC GACGCAAAAA
GTGCAGGACT ACGTGAATGA CGTGGTGCGC ATGGGTCAAC TGTCCAGCGA AGCGCTCTTC
TCTATTCTCG ATGTGCCGCT GGAGATCGAT CTGGCCAACC TGCACAACGA ATTCATTAAC
CCACGCAGTG GCCGCATGAA CATCTGCCAC AGCCTGCGCG CCCTGTGCGA AGTGCTCTGG
CACTATAACT GCGACGTGCT GCTGCTGACG GGTCGTCCGT CGCGTCTCCC GGGGATTCAG
GCGCTGATCC GCCAGCTTCA GCCGGTGCCG CCATCGCGCG TGCTGCCTCT GCATGGCTAT
GAGACCGGTG GCTGGTATCC GTTCAACAAG AAAGGCTGTA TCGACGACCC GAAATCTACC
GCTTCGGTTG GGGCGATGCT GTACCTGTTG GCAGAGAACT CGCGGTTGAG CAGCTTCTTC
TTCCGCACCC AGAACTTTGT GCCTTATTCC ACCATCCGCT ATCTCGGCAT GCTGGACGGC
AACAACCTGA TCAAAGACAG CAACGTCTTT TATCGCGATA TCGATCTCGA CGCGCCGGCG
TTCCAGCTTC CGCAGGGGCA AAGCTTTGAC GCGCGTGGCG AAGTGCGCAT TGGTTTCCGC
CAGCTTGATA ACGAGCGCTG GCCTGCCTCT GCACTTTACA CGCTGAAAAT CGCCAACCCT
AACCTCGCGA GCGAGCTGGC CGGGGATGCC ATGATGCGCA TCGAACTCAC CGCCGAGCAG
GGACGCCCAC GCAACGGCAT CGAGGCGGTC AGTCCCGAGA AGTTCCGCAT TGAATCCCTC
GAGACGGATC ATGCCCGTCG CAACTACAAC CGCAAAGACG TGGCCTTCCA GTTGAACACT
ATGGTCGGTA ACGGTCTCAG CGAGACGCAC TACTGGCTGG ATAGCGGGAG TATCAAAAGC
TAA
 
Protein sequence
MLAELTDFKK QVKIIRDSGI QFLDFAFTLP ARKEYGDFLR QNGDGAILRL VYNEREDKLQ 
IPAADNAAPQ FAESDYSLTT EESLHLYQGL WLPLPFFRFN PPRAFAHGPT NWARVQFHEL
AEPDEKGNTW RAILIFDTKI FPDRDNTQYL APSEDDVRSG AGFALALHPH EMGDFLTLPW
VDEWLREVFS TQARDVLRQH AEDIDEKLGQ KEHQAHYLNL LNILDATVAI PEVQVNDVKI
RDSAIPVDLV LDIGNSRSCG ILIEEHRDDN KGLSQLYQLQ LRDLSQPQNV YNEPFDSRLE
FAQAEFGKQD FSLKSGRSDA FTWPTIGRVG DEAFRMAAQR LGTEGSTGIS SPKRYLWDDQ
PYSPGWRFSQ AFVKSDREPL ATAAPLLYML NDQGKLLIRL PEDERMPVFS PVYSRSSLMT
MMLSEVLSQA LMQINSPAQR LKMNHASTPR RLRNVIMTVP PAMPKPERAI FEQCMLDAIR
LVWKALGWEE MDDESEDNEQ LKHPRPGVHV KWDEATCGQL VYLYNETQTY FGGRTDEFFA
ATRRPDNAPA ANAPRSLKVA SIDIGGGTTD LVISRYTLDD GEGINVRITP KQLFREGFKV
AGDDILLDVI RLYLQPAVKA AIVKVGHSDM TAESMMSQLF GSESIEAGKQ VLRQQLTLQI
FAPLALAILH RYEEYSPESG REQLSFTFRD LLTENMPTQK VQDYVNDVVR MGQLSSEALF
SILDVPLEID LANLHNEFIN PRSGRMNICH SLRALCEVLW HYNCDVLLLT GRPSRLPGIQ
ALIRQLQPVP PSRVLPLHGY ETGGWYPFNK KGCIDDPKST ASVGAMLYLL AENSRLSSFF
FRTQNFVPYS TIRYLGMLDG NNLIKDSNVF YRDIDLDAPA FQLPQGQSFD ARGEVRIGFR
QLDNERWPAS ALYTLKIANP NLASELAGDA MMRIELTAEQ GRPRNGIEAV SPEKFRIESL
ETDHARRNYN RKDVAFQLNT MVGNGLSETH YWLDSGSIKS