Gene Tgr7_1784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTgr7_1784 
Symbol 
ID7317594 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThioalkalivibrio sp. HL-EbGR7 
KingdomBacteria 
Replicon accessionNC_011901 
Strand
Start bp1899181 
End bp1902537 
Gene Length3357 bp 
Protein Length1118 aa 
Translation table11 
GC content58% 
IMG OID643616676 
Producthypothetical protein 
Protein accessionYP_002513853 
Protein GI220934954 
COG category[R] General function prediction only 
COG ID[COG2251] Predicted nuclease (RecB family) 
TIGRFAM ID[TIGR03491] RecB family nuclease, putative, TM0106 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.424401 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATAAAG TTCAGAATAA TCTGATTTTC TCAGCTTCAG ACCTGAGCTA TTTCCTCGAA 
TGCCCCCATC GCACTACGCT TGACCGCCTG AGCCTCGACC AGCCTATGGA CAAGGCTGTA
GCCGGTGAGG AGACGAGACT CATCCAAGAG AAGGGTGTCG AGCATGAGCG GGCCTACCTG
GAATCACTGA AATCCAGTGG TCGGAATGTC GTTGAGATAG ATGACGCGCT CGGTTTGGAT
GAGCGGTGTG CAGCCACTCA GGAGGCCATG CGGGGTGGAG CAGATATCAT CTACCAGGCG
GTCTTTATGA ATGACCGCTG GTTGGGCTAT GCGGATTTTC TCAAGCGTGT AGAAGGGGCT
TCGGAGCTGG GCGATTACAG CTACGAACCG GTGGACACCA AACTTTCCAC ACAACCCAAA
ACCAAGCACC TGATCCAGCT ATGTGTCTAT TCTGATCTGC TGCAGGACGC CCAGGGAACA
GTGCCAGAGT CTATGCACCT GGCACTTGGC AATGGCGAGA GCCGCAGTTT CCGGGTTGAG
GACTATCGGC ATTACTACGC CCGATCTCGG GATCAATTCC TCGGCTTTAT TGAGAACCCG
ACAGAAACCC GGCCGGAACC CTGTGATTTC TGCAACTTCT GCCACTGGCA TGAGCGCTGT
GAAAAGCAGT GGGAAGTTGA GGATCATCTG AGCCTCGTTG CCAATATCAC TAAAGGCCAG
ATCCGCAAGT TGCGGGATGC AGGTATCCAT ACGGTGGCCG ACCTCGCAGG CCACGATCAG
TCTGTAGCTA TTTCTGGCAT GAACAAAGAG GTCCTGGAGC GGCTTCGTGA ACAGGCCTCC
CTTCAGGTCC AGGGCAGGAC TTCCGGCAAA CCGGTCTACC GTCTTTTAAA GCAGGATCCA
GATGGCCGAA AAGGTTTCTT CCGGATGCCT GAACCGTCGG AGGGCGATCT CTTCTTTGAC
ATGGAGGGCG ACCCGCTCTA TCCGGAGGGG CTTGAGTACC TCTTCGGTGT CTATTACCTG
GAGAACGGGG AGTGGCGATT CACCGCGTTC TGGGCCCATG ATCACGATGC CGAGAAGAAG
GCCCTGGAAG ATTTCGTCGA TTTCGTTGTC GAGCGACTGA AGCAGTATCC GGATGCCCAC
ATCTACCACT ACAACCACTA TGAGGTGACC GCGATCAAGC GCCTCATGAG TCGCTACGCC
ACCCGTGAAC GGGAGGTGGA TGATCTGCTG CGTCGCGAGC GTTTCGTGGA CCTTTTCAAG
GTGGTTCGGG AGTCGATCCG TGTCTCAGAG CCATCCTACT CCATCAAAAA TCTTGAGCAT
TTCTACATGG ATGCCCGTGA TGCGGACGTC AAAACGGCCG TGGGTAGTAT CGTCTGGTAT
GAACACTGGC GCGAGAGCCG GGACGACGAC CTGCTGGAAC AAATCCGCAA GTACAACGAG
GATGACTGCC GTTCCACGCT CCTGCTCCGG GACTGGCTCC TTGGGCTCCG TCCTTCCAAT
CTACCCTGGT TCAGCGGGGA GGTGCGGGGG GAGTCCGAGG GGGTGGATAG GATCACCGAG
CATGAACAAC GTCTGGCCCG ATATGAAAAG GCCCTTCTGG GCAACAAGGC GGAAGATCCC
GAGACCCTCC ATCACAACAC GTTGATCTAT CAGTTGCTGG ACTTCCATCG CCGAGAGGCG
AAGCCCCAGT GGTGGGCCAT GTTTGCCCGT CAGGATATGG AAACCGCTGA TCTGATCGAG
GATTCAGAGT GCCTCGGCGG CCTTGAACTC GTGTCGACAG AAAAAGCCAC CACCAAGTCA
ATGGATTGCG TCTATCGGTT CCCGCCCCAG GAGACCAAAC TCAAGGCTGG CGATGTGGTT
CACGTGGCTG AATCAAGTGA TCGTCTGGGG ACGATTCGTT CGCTGGATGA CGAAAATGGC
ACCGTTACGA TCAGGACAAC CTTCGCCGAG ATCCCCGAGA GTCTTTCGAT CGGGCCAGGT
GGTCCTGTCG AAACCCGGGT GCTGAGCGAG GCGTTGTTCC GCTACGCCGA TGCGCATCTT
GCGAAAAAGG CCTGTTACCC AGCGCTCGAT GCCTTCCTCA CCCGTAAGCC CCCGCGTCTG
AAGGGCAGGG AGGCCCCTGA GCCGCTGGCG CCCCATGGCC ATCTGGCTCA GATTCTCGAT
GCCGTGGAAC GGCTGGATGG CAGTCACCTG TACATCCAGG GGCCGCCGGG TGCGGGCAAG
ACCTACACCG GCTCCCACCT GATTGTCGCC CTGCTGCGCA AGGGGTTCCG CATCGGCGTG
ACCTCGAACA GTCACAAGGC GATCGACAAC CTGCTCGAGG CCGTGGAAAA GGTGGCCCAG
AAAGAAGGTG TGGCGTTCAG GGGCGTGAAA AAGACCACCC AGAAGAACGA TCTGGGTTTT
GAGGGGCACC AGATCGAGGC CTGTACTTCC AATCCTGACA TCGAGAGCGA TGAGGACTAT
CAACTGGTGG CTGGGACGGC ATGGCTGTTT GCCCGGGAGG GCCTGGATCA GCGCTTTGAT
TACCTCTTCA TTGATGAGGC AGGGCAGGTG TCACTGGCCA ACCTGGTGGC CATGGGGACC
AGTGCGCGGA ACCTGGTATT GCTCGGCGAT CAGATGCAGC TCTCCCAACC GATCCAGGGA
TCGCATCCGG GCCACTCGGG CGACTCGGCC CTGGACTATC TGCTGAACGG GCGTTCGGTG
ATCGAGCCGG AGCGGGGTGT CTTCCTGGAA ACCACCCGGC GCATGCATCC CGATGTGTGT
CAGTTCATCT CGGACGCCAT CTATGCCGGT TGTCTCATGC CCCATGAGGA TAATCACAAG
CAGCGCCTGA TGCTCAGCTC CACGGCTCAC CCGGAGCTGG TCCCTAACGG AATCCGCATG
GTCGAGGTGA TGCATTCGGA TCGCGGTCAG CAGTGCCCGG AAGAGGCGGA TGTAATTGAG
GCCATGTACC AGAGCCTGCT CAAGCAGAGT TACGTAGACA AGCACGGCAA GGAGCACCCG
ATGAGGCAGG AGAACATCCT CGTGATCTCG CCCTACAACA TCCAGGTGAA CCTGCTTAAA
GGCAGGCTTC CGCCTGGGGC CAGGGTCGGC ACCGTGGACA AGTTCCAGGG GCAGGAGGCC
GAGGCTGTGT TGATCTCCAT GGCGACCTCT GACGCCGACA ACATGCCCCG CAACGCGGAT
TTCCTTTTCA GCCGAAACCG GCTGAATGTG GCCATCTCCC GTGCCCGCTG TCTGGCTGTG
GTGGTGGCAA ATCCTGCCTT GCTGAACGTG CCCTGCAAGA GTGTCACGGA GATGGAGTTG
GTCAATACCT TGTGCTGGCT GAAGTCATAC TCCGATCAGC TCCGCGCTCG CCATTGA
 
Protein sequence
MHKVQNNLIF SASDLSYFLE CPHRTTLDRL SLDQPMDKAV AGEETRLIQE KGVEHERAYL 
ESLKSSGRNV VEIDDALGLD ERCAATQEAM RGGADIIYQA VFMNDRWLGY ADFLKRVEGA
SELGDYSYEP VDTKLSTQPK TKHLIQLCVY SDLLQDAQGT VPESMHLALG NGESRSFRVE
DYRHYYARSR DQFLGFIENP TETRPEPCDF CNFCHWHERC EKQWEVEDHL SLVANITKGQ
IRKLRDAGIH TVADLAGHDQ SVAISGMNKE VLERLREQAS LQVQGRTSGK PVYRLLKQDP
DGRKGFFRMP EPSEGDLFFD MEGDPLYPEG LEYLFGVYYL ENGEWRFTAF WAHDHDAEKK
ALEDFVDFVV ERLKQYPDAH IYHYNHYEVT AIKRLMSRYA TREREVDDLL RRERFVDLFK
VVRESIRVSE PSYSIKNLEH FYMDARDADV KTAVGSIVWY EHWRESRDDD LLEQIRKYNE
DDCRSTLLLR DWLLGLRPSN LPWFSGEVRG ESEGVDRITE HEQRLARYEK ALLGNKAEDP
ETLHHNTLIY QLLDFHRREA KPQWWAMFAR QDMETADLIE DSECLGGLEL VSTEKATTKS
MDCVYRFPPQ ETKLKAGDVV HVAESSDRLG TIRSLDDENG TVTIRTTFAE IPESLSIGPG
GPVETRVLSE ALFRYADAHL AKKACYPALD AFLTRKPPRL KGREAPEPLA PHGHLAQILD
AVERLDGSHL YIQGPPGAGK TYTGSHLIVA LLRKGFRIGV TSNSHKAIDN LLEAVEKVAQ
KEGVAFRGVK KTTQKNDLGF EGHQIEACTS NPDIESDEDY QLVAGTAWLF AREGLDQRFD
YLFIDEAGQV SLANLVAMGT SARNLVLLGD QMQLSQPIQG SHPGHSGDSA LDYLLNGRSV
IEPERGVFLE TTRRMHPDVC QFISDAIYAG CLMPHEDNHK QRLMLSSTAH PELVPNGIRM
VEVMHSDRGQ QCPEEADVIE AMYQSLLKQS YVDKHGKEHP MRQENILVIS PYNIQVNLLK
GRLPPGARVG TVDKFQGQEA EAVLISMATS DADNMPRNAD FLFSRNRLNV AISRARCLAV
VVANPALLNV PCKSVTEMEL VNTLCWLKSY SDQLRARH