Gene Ent638_0865 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_0865 
Symbol 
ID5111054 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp961151 
End bp964279 
Gene Length3129 bp 
Protein Length1042 aa 
Translation table11 
GC content57% 
IMG OID640491041 
Productexonuclease subunit SbcC 
Protein accessionYP_001175600 
Protein GI146310526 
COG category[L] Replication, recombination and repair 
COG ID[COG0419] ATPase involved in DNA repair 
TIGRFAM ID[TIGR00618] exonuclease SbcC 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.379439 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATTC TCAGCCTGCG TTTGAAAAAC ATCAACTCCC TGAAGGGCGA ATGGAAAATT 
GATTTCACCG CCGAGCCGTT CGCCAGTAAC GGTCTTTTCG CAATTACCGG CGCGACCGGC
GCAGGCAAAA CCACCCTGCT CGACGCCATC TGCCTCGCGC TGTATCACCA GACGCCGCGT
CTGAATAAAG TCACACAATC ACAAAATGAT CTCATGACGC GCGACACCGC TGAATGTCTG
GCGGAAGTGG AATTTGAGGT TAAGGGCACC GCTTATCGCG CTTTCTGGAG CCAAAGCCGC
GCGCGTAATC AGCCTGACGG GAATTTACAG GCACCGCGCG TAGAGCTGGC GCGCTGCGAA
GACGGTAAAA TTCTGTCTGA CAAAGCCACA GACAAAATTG AGCAAACAGC CACGCTGACC
GGTCTGGATT ACGATCGTTT CACGCGCTCT ATGCTGTTGT CGCAGGGGCA ATTTGCCGCC
TTCCTCAATG CCAAACCTGG CGAACGCGCC GAGCTGCTGG AAGAACTTAC CGGAACCGAG
ATTTACGGTC AGATTTCCGC AAGGGTCTTT GAACAACACA AAGCCGCACG CACGGAGCTC
GAAAAGCGCG AAGCGCAAGC GGCTGGTGTG ATTTTACTGA GCGAAGAGCA GCAACAGCAG
CTCACCCAGA GTTTGCAGGC ACTTACTGAC GAAGAGAAAA TGCTGCTGGT ACAGCAACAG
AATCACCAGC GCGATTTTCA CTGGTTGACC CGTCATGAGG AACTCCAGCG TGAACAGCAG
CGCGCGTTAA CCTCGCAACA GGAGGCGCAA CAGGCGCTCA CCGACGCCGC ACCAGACCTG
GATAAATTGC TTCACGCCCA GCCCGCCGCA GTGCTGCGCC CACTCTGGGA ACGTCAGCAG
GAACAGACGA CGCGTCTCAC ACAAACCCAA CAGCAGACTC AGGAAGTGAA TACTCGCTTA
CTGTCTCGGG CAGCACTGCG CGCGCGCATC CGCAACGGTG CGCTGCGCAC CCGCGACCGG
TTGCAAACCG AGCTTTCGGC ACTGACACAG TGGCTAGCCG GGCATGATCG TTTCCGTCTG
TGGGGGCAGG AAATCGCAGG CTGGCGAGCG CATTTTGCTC AGCTTAATCG CGACAAAAGC
CAGATAGCGA CGCTGACCGG ACGCATCACC GAGCTGCGGC AAAAACTGGC GAACGCGCCA
GAGATCGCCC TGACGCTGAC CGCTGATGAA GTCGCTGCGG CGATGGACCA GCAAACGGCG
TCGCGTGCAT CTCGCCAGCA GTTGACCACC CTTCATGCCC GTTATCAACC GCTGCAAAAA
CGCATCGTTC AAAACGCGGA AAGCGTGCAA AAGGCGCAGG CGGAGCAGAT AAAACTGAAC
GAGACGTTGG CCCTGCGTCG ACAGCAGTAT AAAGAGAAAC ACCAGCATTT TCTGGATGTT
AAAGCCTTAT GCGAGCGAGA AGCCGAGATT AAAGATCTCG AGGGTTATCG AGCCCGTCTT
GAAGCGGGAA AACCCTGCTT GCTCTGCGGT TCAACTGAGC ATCCTGCGGT GGAGCAATAC
CAGGCGTTGG AAATAACAGA AAATCAGCGC CGCCGTGATG CGATGGAAAA AGAGGTCGCA
GCGCTAAAAG AGGAGGGGCT GCTGGTGTTG GGGCAAGTCA ATGCCCTGGC CAGTCAGATT
CAGCGCGACA CCGACGAGGC GCAGACGCTC GCCCAGGAAG CGCAAACACT CACGAGCGCG
TGGCGCGAAC TGTGTGCAAC CCTGAACGTT ACTTTGAATA TTGATGATAT TGCTCCGTGG
CTGAACGAAC AGGAGCAGTA CGAGCGCCAG CTGTATCAAC TCAGTCAGCG CCTGGTCTTA
CAAAGCCAGC TTAATGAACA GGAACACCAG GAGCGGCAAC TCCAGCAGCA GATTGCTACA
ACACGCCTGA CGCTGGAAAA CGCGCTGAAT GCGCTGTCGT TGAAGGTCCC CGATGAGGGT
GCTGAAACGC TCTGGCTCAC CGAGCGCGAA AGCGAATCGT CGCACTGGCA GGCGCAGCAG
AATCAGCTCA CCGCCCTTCA GGAACGTATC AACGCCCTAA CGCCGCTGCT GGATACGCTG
CCCGCGACGG ACACAGAAGA CGCGGAATCC GTTATTCCGG ATAACTGGCG CGAGATCCAC
AATGAATGCG TCTCGCTGCA AAGCCAACTT GCGACATTGC AGCAGCAGGA AAGCCTTGAA
ACCGAGCGCA TGCAGCAAGC GCAGGCGCAA TTTGCCGCTG CGCTGGCGGG AAGCGCATTC
CCCGACTGCG ATGCGTTCCT GCGCGCTTTG CTTGATGCCG ACACGATGCA GCGTCTGGAA
ACGTTAAAGC AAACGCTGGA AAACCGGATT CAGCAAGCGA CAGCGCTGGT GCGTCAGGCA
AATCAATTGC TGAGCGAGCA TCTGGCGCAG CGTCCGGAAG GATTGCAGTC GGATGTGCCG
ACACTGCAGC TTGAACTCCA GCAACTGGCG CAACGTCTGC GTGAAAATAG CACCCGTCAG
GGTGAGATCA GCCAGCAGCT CAGGCAGGAT GGCGAGAACC GGCAGCAGCA GCAGGCGCTT
ATCCAGCAGA TTGACGAGGC GGCTCGCCTT GCGGATGACT GGGGATATCT GAACGCGCTG
ATCGGCTCCA GTACGGGCGA TCGTTTCCGT AAATTCGCTC AGGGGTTGAC GCTCGACAAT
CTGGTATGGC TGGCGAATCA GCAGTTGAAT CGCCTGCATG GCCGCTATCT TCTGCAACGC
AAAGCGAGCG ACGCGCTGGA GCTGGAAGTG GTAGATACGT GGCAGGCCGA CGCGGTACGC
GACACGCGAA CCCTCTCCGG AGGCGAGAGT TTCCTGGTGA GCCTGGCGCT GGCGCTGGCG
CTTTCTGACC TGGTGAGCCA CAAAACGCGT ATTGATTCCC TGTTCCTCGA CGAGGGATTC
GGTACGCTGG ATAGCGAAAC GCTGGATACC GCACTCGACG CGCTCGATGC GCTGAACGCG
ACGGGAAAAA CTATCGGCGT GATCAGTCAT GTGGAAGCAA TGAAAGAGCG CATTCCTGTG
CAGATCAAAG TGAAGAAAAT TAACGGGCTG GGATACAGCA AGCTGGATAA GGTGTTTGCG
GTGGAGTGA
 
Protein sequence
MKILSLRLKN INSLKGEWKI DFTAEPFASN GLFAITGATG AGKTTLLDAI CLALYHQTPR 
LNKVTQSQND LMTRDTAECL AEVEFEVKGT AYRAFWSQSR ARNQPDGNLQ APRVELARCE
DGKILSDKAT DKIEQTATLT GLDYDRFTRS MLLSQGQFAA FLNAKPGERA ELLEELTGTE
IYGQISARVF EQHKAARTEL EKREAQAAGV ILLSEEQQQQ LTQSLQALTD EEKMLLVQQQ
NHQRDFHWLT RHEELQREQQ RALTSQQEAQ QALTDAAPDL DKLLHAQPAA VLRPLWERQQ
EQTTRLTQTQ QQTQEVNTRL LSRAALRARI RNGALRTRDR LQTELSALTQ WLAGHDRFRL
WGQEIAGWRA HFAQLNRDKS QIATLTGRIT ELRQKLANAP EIALTLTADE VAAAMDQQTA
SRASRQQLTT LHARYQPLQK RIVQNAESVQ KAQAEQIKLN ETLALRRQQY KEKHQHFLDV
KALCEREAEI KDLEGYRARL EAGKPCLLCG STEHPAVEQY QALEITENQR RRDAMEKEVA
ALKEEGLLVL GQVNALASQI QRDTDEAQTL AQEAQTLTSA WRELCATLNV TLNIDDIAPW
LNEQEQYERQ LYQLSQRLVL QSQLNEQEHQ ERQLQQQIAT TRLTLENALN ALSLKVPDEG
AETLWLTERE SESSHWQAQQ NQLTALQERI NALTPLLDTL PATDTEDAES VIPDNWREIH
NECVSLQSQL ATLQQQESLE TERMQQAQAQ FAAALAGSAF PDCDAFLRAL LDADTMQRLE
TLKQTLENRI QQATALVRQA NQLLSEHLAQ RPEGLQSDVP TLQLELQQLA QRLRENSTRQ
GEISQQLRQD GENRQQQQAL IQQIDEAARL ADDWGYLNAL IGSSTGDRFR KFAQGLTLDN
LVWLANQQLN RLHGRYLLQR KASDALELEV VDTWQADAVR DTRTLSGGES FLVSLALALA
LSDLVSHKTR IDSLFLDEGF GTLDSETLDT ALDALDALNA TGKTIGVISH VEAMKERIPV
QIKVKKINGL GYSKLDKVFA VE