Gene Tgr7_2374 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTgr7_2374 
Symbol 
ID7318163 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThioalkalivibrio sp. HL-EbGR7 
KingdomBacteria 
Replicon accessionNC_011901 
Strand
Start bp2506922 
End bp2508001 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content65% 
IMG OID643617272 
ProductProtein involved in cellulose biosynthesis (CelD)-like protein 
Protein accessionYP_002514439 
Protein GI220935540 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5653] Protein involved in cellulose biosynthesis (CelD) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACGGA TCGAAGTCAT CACAAACCTG GAGGGACTGC ATGCCCTGGC ACCCGACTGG 
AAGTCGCTCG GCGCCCGGTT CCCGACCCCG CTGCTGCAGT ACGAATGGTT CCTCAGCTGC
GCAGAGGCAC TCTACCCCGG TGGTGAGCTT CGCGTGGTCA CCCTGGAGGA AGATGGACGG
CTGACCGCCC TCGCCCCCCT CGCCCAGGTG CGACGCAGAC ACGGGCGATG GCTGGAAATC
ATGGGTGTTT CAAAACTGCA CGAGCCCGCC GGGCTGCTCT ATGAGAATCC GGCAGCACTG
GAGAGACTCA TGCAGGCGCT GACGCGTCTG GGCCATCCCC TGGACCTGGG ACGACTGCCC
GGAAGCTCCC CACTGCTCAC GGGCCCCGGC ATCCACATCG ATGCGAGGGG ACTATGGCTG
CGCAGACCCG GGGCGGACCG TTGTGTCCTG GACCTTCAGG AGGGATGGGA CGGGGTCTGG
TCTGCCATGC GCAAGAGCCG GCGCACGGAC TTCCGCCGTC TCGAACGTCG GGCACGGGAC
CAGGGCGGTT TTCAACTGCA GGTGGAGAAA CCGGTACCGG CCCAGGTGGA CGACCTGCTG
GAACAGGCAA TGGACATTGA GGCGGCAGGC TGGAAGGCCT CGCGGGGTAA CACCCTGAGG
CAAGACCCGA TCCTGGCCGA TTTTTTTCAC CGCTATTGCC GTCGCATGGC TGAACACGGC
GCGCTCCATC TCTACTACCT GCGCCTTGGC GGGCATACCG TTGCCATGCA TGTCACCGTA
TCATTCGCCA GCGTCCTCTG GGTACTCAAG ATCGGCTACG ACGAGCGCTG GCGCAACCTT
TCGCCCGGCC TCTACCTGGC GCTGGAAACC ATCCGGCACG CCGCCCAGAA TGGCCTGTCT
GCCTATGAAT TCCTGGGCTC TGCGGAGGAC TGGCAAGGGG CATGGCCTAC CCGCCGGGAG
TCCCACGGGA CGCGGATCTG TCTGCCCTTG AGTGCCGAGG GCATGCGCGG CCTTCTGGAT
CTGGCAAGCA CCAGACTCGG TCGCCATACA AGCCGGCCAG ACCGGGCCGC TCCAGCTTGA
 
Protein sequence
MTRIEVITNL EGLHALAPDW KSLGARFPTP LLQYEWFLSC AEALYPGGEL RVVTLEEDGR 
LTALAPLAQV RRRHGRWLEI MGVSKLHEPA GLLYENPAAL ERLMQALTRL GHPLDLGRLP
GSSPLLTGPG IHIDARGLWL RRPGADRCVL DLQEGWDGVW SAMRKSRRTD FRRLERRARD
QGGFQLQVEK PVPAQVDDLL EQAMDIEAAG WKASRGNTLR QDPILADFFH RYCRRMAEHG
ALHLYYLRLG GHTVAMHVTV SFASVLWVLK IGYDERWRNL SPGLYLALET IRHAAQNGLS
AYEFLGSAED WQGAWPTRRE SHGTRICLPL SAEGMRGLLD LASTRLGRHT SRPDRAAPA