Gene Haur_2372 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2372 
Symbol 
ID5734253 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3021359 
End bp3023887 
Gene Length2529 bp 
Protein Length842 aa 
Translation table11 
GC content51% 
IMG OID641279513 
Productcellulose-binding family II protein 
Protein accessionYP_001545140 
Protein GI159898893 
COG category 
COG ID 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTACGAC GAATGTATGC TCGTGGAACA TTGTTGGTAT TGTTGATCGC CTTGATCAGC 
CTTAGCGTTG CACATCGCAC GGCTACGCCC AGCGCGGCAG CGGCCAGTTG CGTGGTCAGC
TATAAGGTGA TCAATCAATG GGCCGATGGG TTTATTGGCG ATGTGACCGT TACCAATAAT
TTGGCAGCAA CCACAACTTG GCAGTTAAGC TGGACGTTTG CTGGCAATCA GCGGATTGTT
AATCTGTGGA ATGGCGTTTT GACCCAAACT AACGCCGCCG TCAGTGTGCA AAATGCCGCT
TGGAATGGCT CGATCACCAG CGGTGGCAGC GTTAATGTTG GCTTTCAAGC AACATTCAGT
GGCACGAATA GTATTCCCAC GAGTTTTACC CTGAATGGGG TGGTTTGTGG CGAAACTAAC
GTAACGATCA CACCGCCAAC CAGTAACCCA ACCAACACGG CGACGGCGCG GCCCCCAACT
AACACCCCAA TGCCCCCAAC TGGCACTCCG CGTCCATCGA ATACCCCAAC GATTGCTCCA
ACTACCCCAA CCGCAACGCC CAACAACAAT GGCTGTATTG GCGCAATTAT CTGCGACGAT
TTTGAAAACC AAACCGGCAT TGATCCCAGT GGATTTTGGC AAGCGCTCTA TCCTGATTGT
ACTGGCCATG GCTCAGTGAT GGTCGATAAC ACCTATGCCA ATAGCGGCAG CAAATCGATC
ATGGTTCATG GGCATAGTAA TTTCTGTGAT CATGCCTTTT TCGGCAATAC CACGGCGATC
GAAAGCATTG GCAATCTGTT GTATGGCCGC TTTTATGTGC GCCAAGATTT AGCTTTGCCC
GACCATCACA TCACATTAAT GGCGCTCAAA GATCGCAACG ACGCTGATAA TGATTTGCGC
TTGGGCGGCC AGCAAGGGGT GCTCGATTGG AATCGCGAAT CGGATGATGC AACCGCCCCA
TCGATGAGCA GCGCAGGCAT CGCCAAATCG GTCACGATTC CACCTCGTCA GTGGACATGC
GTCGAATTCA AGATCGATAG TGCCAATGGC TATTTGCAGG CCTGGGTCAA CGGCAGCGAA
GTTGAAGGCA TGCGGGTTGA TGGGGTCGAT ACGCCTGATA TTGACCAAGC TTGGCGTACT
CGTGCCAATT GGCAACCACG CTTAGTTGAT ATTCGGTTGG GTTGGGAAAG CTACGGCGGC
GATGATATTA CCCATAATAT TTGGTTTGAT GATGTGGCCC TGGCAACTCA ACGTATCGGC
TGTAGCGCTA GCAACCCAAC TACGCCGACC GCTACTCCGC GCCCAACCAA CACCACCACG
CCCGTGGTTG GCACGCTCAC GCCTAGCCCA ATTCCCAGTG CAACGCCAAC CATCGCGCCC
AATGGCATTA CTGCTGCTAC CACGATTGCC GAGCTTTGCC CAACCTACCG CCAACTCTAT
ATCGATCGTG ATTTTCTCGA TTATTTGCCG AGCGACGGCA GTGGCGGCCA TAACAACATG
CCAGTTGATG GCGGCATGAC TGATGCCCAA AAAGCTAGTT TATACGGCGT GAATATCAGC
GCCATTCAAA GCAAAATTGG CAATGGCACA TTAACTTTGG GCGAATTGGG AACCCAAGCC
TTGGGCCATG TGCAACGCTT AGTCAATCAA AATTTCCCCA AAAATGCGAT TTGTCAGTTG
TTGCCACGCT TGATGCTGCT AGGGCCGGAG ACCGAAACCG CTACCTTCCA CAAAAATAGC
AGCAATCCAT GGGCTGAAAC CGCTGGCCCC GTCAATGCGA TTGCCCCCGC TGGATTTATG
CAAACCCGCT GGCCAACTGA TGCTCGTACC TATGTGCCAG CCGAAAAAGC CGAACGCGAC
CGTTGCCACG ATCAGCCAGT GCATGAAAAT AATCTTGGCT GGACATTTAG CTCGATCGTT
GATCCGAGTA TTTTGTATGA TCCAAATAAT CCAGTGTTGA ACGCAATTCG CACGAGTACC
CACCCAATCA GTGGCTTGCC GATGGGGCCA GGCTTTAGCA GTAATGCGCC GATGCAGGCA
ACGACCAAAC TGCACGAGGA AAATTCGGGT TTCTGGTATC AGGTGATTCA GTTCAAAAAT
ACCAGTACTA TTCCCTATTA CCTCGATTGT GCCATGATTT GGTGGGTCGG GCCATCGGGC
TTATCGTTTG ATTTACGTAA TGGTCATTAT AATAATGAAC AACGCCCTGG CCGTGGCTAT
GGTCACCCAC AGCGCGATAT CATCGAAGTG GTGTACAATC AAACACAAAA GCTCTCGGTG
TATAGCATTC GTTTGTCGTT CCACGATGAG CCGTACAACA TGCGCACGGC CTACCCCAAC
CAATATTGGT CGTTGGAAGT TGGCACGCCC GCCTTTTTGA ATGGACAGGC GCGTTATACC
ACCTCAGCCG AACGCCAAGC CTTGATGGAT TTGATGCTCA ACACATTGCA TGTCGAGCTT
GAAACCAACC TTGATCGCAA TATTGAGCTA TTCGATGCCC TGAAGATGCG TAATCGGGTT
TCGAATTAG
 
Protein sequence
MLRRMYARGT LLVLLIALIS LSVAHRTATP SAAAASCVVS YKVINQWADG FIGDVTVTNN 
LAATTTWQLS WTFAGNQRIV NLWNGVLTQT NAAVSVQNAA WNGSITSGGS VNVGFQATFS
GTNSIPTSFT LNGVVCGETN VTITPPTSNP TNTATARPPT NTPMPPTGTP RPSNTPTIAP
TTPTATPNNN GCIGAIICDD FENQTGIDPS GFWQALYPDC TGHGSVMVDN TYANSGSKSI
MVHGHSNFCD HAFFGNTTAI ESIGNLLYGR FYVRQDLALP DHHITLMALK DRNDADNDLR
LGGQQGVLDW NRESDDATAP SMSSAGIAKS VTIPPRQWTC VEFKIDSANG YLQAWVNGSE
VEGMRVDGVD TPDIDQAWRT RANWQPRLVD IRLGWESYGG DDITHNIWFD DVALATQRIG
CSASNPTTPT ATPRPTNTTT PVVGTLTPSP IPSATPTIAP NGITAATTIA ELCPTYRQLY
IDRDFLDYLP SDGSGGHNNM PVDGGMTDAQ KASLYGVNIS AIQSKIGNGT LTLGELGTQA
LGHVQRLVNQ NFPKNAICQL LPRLMLLGPE TETATFHKNS SNPWAETAGP VNAIAPAGFM
QTRWPTDART YVPAEKAERD RCHDQPVHEN NLGWTFSSIV DPSILYDPNN PVLNAIRTST
HPISGLPMGP GFSSNAPMQA TTKLHEENSG FWYQVIQFKN TSTIPYYLDC AMIWWVGPSG
LSFDLRNGHY NNEQRPGRGY GHPQRDIIEV VYNQTQKLSV YSIRLSFHDE PYNMRTAYPN
QYWSLEVGTP AFLNGQARYT TSAERQALMD LMLNTLHVEL ETNLDRNIEL FDALKMRNRV
SN