Gene Ent638_3820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_3820 
Symbol 
ID5110864 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp4118685 
End bp4121015 
Gene Length2331 bp 
Protein Length776 aa 
Translation table11 
GC content58% 
IMG OID640494029 
ProductRNA-binding S1 domain-containing protein 
Protein accessionYP_001178526 
Protein GI146313452 
COG category[K] Transcription 
COG ID[COG2183] Transcriptional accessory protein 
TIGRFAM ID[TIGR00426] competence protein ComEA helix-hairpin-helix repeat region 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAAAG ATTCGCTCTG CCGCATTATT GCGGGTGATA TTCAGGCCAG GGCCGAACAG 
GTAGAAGCTG CCGTTCGCCT GCTTGACGAA GGGAACACCG TGCCGTTTAT CGCACGTTAT
CGTAAGGAAA TCACCGGCGG TCTGGATGAC ACGCAGTTAC GTAACCTGGA AACCCGTCTG
GGCTACCTGC GCGAGCTGGA AGACCGTCGT CAGGCCATTC TCAAGTCCAT TGGCGAACAG
GGCAAATTGA CCGAGGCGCT GGCGGGTGCC ATCAACGGTA CGATGAGCAA AACCGAGCTT
GAAGACCTCT ATCTGCCGTA TAAACCGAAA CGCCGTACTC GCGGGCAGAT CGCGATTGAA
GCGGGCCTTG AGCCGCTGGC CGATCTGCTG TGGAACACCC CGTCGCACGA TCCTGAAACG
GAAGCCGCGA AATTCATCGA CACTGACAAA GGCGTAGCGG ACACCAAAGC CGCCCTCGAT
GGCGCCCGTT ATATTCTGAT GGAACGCTTC GCCGAAGACG CGGCGCTACT GTCAAAAGTG
CGTGATTATC TGTGGAAGAA CGCGCATATC GTCGCGAAAG TTGTCAGCGG CAAAGAAGAG
GAAGGCGCGA AATTCCGCGA CTACTTCGAT CATCACGAAC CGATTTCTAC CGCGCCATCG
CACCGTGCGC TGGCGATGTT CCGTGGCCGT AACGAAGGCA TCCTGCAGCT TTCTCTCAAT
GCCGATCCGC AGTTTGATGA GCCGCCGAAA GAGAGCTATT GCGAGCAGAT CATTACCGAC
CATCTTGGCC TGCGCCTGAA TAACGCGCCT GCTGATAGCT GGCGTAAAGG CGTGGTGAGC
TGGACGTGGC GTATCAAAGT GCTGATGCAC CTTGAAACTG AACTGATGGG CACCGTGCGC
GAACGCGCTG AAGACGAAGC CATCAATGTC TTTGCCCGTA ACCTGCATGA CTTGCTGATG
GCCGCCCCAG CGGGCCTGCG CGCCACGATG GGTCTCGATC CCGGCCTGCG TACCGGCGTG
AAAGTGGCTG TCGTTGACGC CACCGGCAAA CTGGTCGCCA CCGACACGAT TTATCCGCAC
ACCGGTCAGG CCGCAAAAGC CGCCGTTGCC GTGGCGGCAC TTTGCGAAAA ACATAACGTC
GAGCTGGTGG CGATTGGTAA CGGTACTGCA TCGCGTGAAA CCGAGCGTTT CTTCCTGGAC
GTGCAGGAGC AGTTCCCGAA AGTCACCGCG CAGAAAGTCA TCGTGAGCGA AGCGGGTGCG
TCGGTCTATT CCGCGTCCGA ACTGGCGGCG CTCGAATTCC CGGATCTGGA CGTTTCCATT
CGCGGCGCTG TCTCTATTGC CCGCCGTTTG CAGGATCCGC TGGCCGAGTT GGTGAAGATC
GACCCGAAAT CTATCGGTGT GGGTCAGTAC CAGCATGACG TCAGCCAGAC TCAGCTGGCG
CGTAGGCTGG ATGCGGTCGT GGAGGACTGC GTGAACGCCG TTGGCGTGGA TCTGAACACC
GCGTCCGTCG CGCTGCTCAC CCGTGTGGCT GGGCTGACGC GCATGATGGC GCAAAATATC
GTCTCGTGGC GTGATGAGAA CGGACAGTTC CAGAACCGTC AGCAGTTACT CAAAGTCAGC
CGTCTCGGGC CAAAAGCCTT TGAACAGTGC GCGGGCTTCC TGCGTATCAA CCACGGCGAT
AACCCGCTGG ACGCCTCAAC GGTTCACCCG GAAACGTATC CTGTGGTGGA ACGTATTTTG
GCCGTCACGC AGCAAGCGCT GAAAGATCTG ATGGGCAACA GCGCGGAACT GCGCAACCTG
AAGGCCGTCG ATTTCACCGA TGAGCAATTC GGTATCCCAA CCGTCACAGA CATCATCAAA
GAGCTGGAAA AGCCAGGCCG CGACCCGCGT CCTGAGTTTA AAACGGCGAA ATTTGCCGAA
GGTGTTGAGA CGATGAAAGA CCTGCTGCCT GGCATGGTGC TGGAAGGCGC GGTCACGAAC
GTCACCAACT TTGGCGCGTT TGTCGATATC GGCGTGCATC AGGACGGTCT GGTCCATATT
TCGTCACTGG CAGATAAATT TGTTCAGGAT CCGCATACCG TGGTGAAAGC GGGCGACATC
GTGAAGGTCA AAGTGCTCGA AGTGGATATG CCCCGCAAGC GTATCGCGCT GACGATGCGT
CTGGACGAGC AGCCAGGCGA AACCAACGCA CGTCGTGGTA ACAGCGGCGG TGCACGCGAG
CAGGCACGCC CGGCGGCAAA ACCGGCGCAA CCGCGCGGTC GCGAAGCACA GCCTGCAGGC
AACAGCGCCA TGATGGATGC GCTGGCTGCG GCGATGGGTA AAAAACGTTA A
 
Protein sequence
MMKDSLCRII AGDIQARAEQ VEAAVRLLDE GNTVPFIARY RKEITGGLDD TQLRNLETRL 
GYLRELEDRR QAILKSIGEQ GKLTEALAGA INGTMSKTEL EDLYLPYKPK RRTRGQIAIE
AGLEPLADLL WNTPSHDPET EAAKFIDTDK GVADTKAALD GARYILMERF AEDAALLSKV
RDYLWKNAHI VAKVVSGKEE EGAKFRDYFD HHEPISTAPS HRALAMFRGR NEGILQLSLN
ADPQFDEPPK ESYCEQIITD HLGLRLNNAP ADSWRKGVVS WTWRIKVLMH LETELMGTVR
ERAEDEAINV FARNLHDLLM AAPAGLRATM GLDPGLRTGV KVAVVDATGK LVATDTIYPH
TGQAAKAAVA VAALCEKHNV ELVAIGNGTA SRETERFFLD VQEQFPKVTA QKVIVSEAGA
SVYSASELAA LEFPDLDVSI RGAVSIARRL QDPLAELVKI DPKSIGVGQY QHDVSQTQLA
RRLDAVVEDC VNAVGVDLNT ASVALLTRVA GLTRMMAQNI VSWRDENGQF QNRQQLLKVS
RLGPKAFEQC AGFLRINHGD NPLDASTVHP ETYPVVERIL AVTQQALKDL MGNSAELRNL
KAVDFTDEQF GIPTVTDIIK ELEKPGRDPR PEFKTAKFAE GVETMKDLLP GMVLEGAVTN
VTNFGAFVDI GVHQDGLVHI SSLADKFVQD PHTVVKAGDI VKVKVLEVDM PRKRIALTMR
LDEQPGETNA RRGNSGGARE QARPAAKPAQ PRGREAQPAG NSAMMDALAA AMGKKR