Gene Haur_1281 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1281 
Symbol 
ID5733174 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1491032 
End bp1492228 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content46% 
IMG OID641278421 
Productrestriction modification system DNA specificity subunit 
Protein accessionYP_001544057 
Protein GI159897810 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGAAC TGATTACCGC GCTCGATCAG TTAGCTGAAA CGCCCCAAGC CATCAGTCGC 
ATTCGCGAGC TTATCTTGCA ATTGGCTGTG CAAGGCCGCT TGGTCGAGCA AGATCCTAAC
GATGAGCCAG CGTATAACAT GTTTAAGCCG CTAATCAAAG AGCAACAAAT GTTGAATATA
GACGTTAGAT CATCTATAAA TAAAGAACAT ACAAAATTTC AGATTCCTCC TTCATGGATA
TGGGTATCAC TGGATGATAT TGTAGTCTAT GATGCAGGTT CAAAGCATGA TCCCAACAAT
CTTGATCCTG ATAGTTGGTT GTTAGAACTT GAGGATATTG AAAAAAATAC CTCTGTTATT
TTAGGACAAT TTCTAGTAAA AGAGCGAAAG CCTAAAAGTA ACAAAGCAAG CTTCCAGAAA
AATGATATTC TTTATGGAAA ATTGCGACCT TATTTGAATA AAGTTATTGT TGCTCATACT
TCAGGATTTT GTACTACTGA AATAGTGGTG TTACGTCCAA AATTGGAATT GAGTCCCTTC
TATATACAAA ATTTCCTCAA AAGCCCCTTT TTCGTTAGCT ACGTAAACCA ACATTCATAT
GGAACAAAGA TGCCTCGACT AGGAACACTA GATGGCAAAA AGGCATCTAT ACCCCTACCA
CCACTCGCTG AACAACAACG CATCGTCGCC AAAGTTGCGC AATTGATGGC GTTGTGCGAT
CAGCTTGAGC AGCAGCAAAC CAGCCGCGAG GCGCTGCGCC AGCAAGTCCA GCAAAGCGCA
ATCAAGCAGC TTTTGAGCGA GCTAGCCCGA CCAGCCGATG CGCAGCAGAT TGCCCAACCA
AGCCAACCTG AGCGCCAAAC CAGCCTCTTT ACTCGGCCCA CTCTGGCGAA TCAACCAATC
GAGCCGCTTG CTGACGACGG ATTGAGCATA AGCAGTGAGC AACAGTTGTT TTTTGAGCAG
TTTGACGATC TGCATACCAC GCCCAAAGCA ATCGGCCAAT TGCGCGAATT GATTTTGCAG
CTCGCCGTGC AAGGCCGCTT GGTGGCTCAA AACCCCAGCG ACGAGCCAGC GAGCATTTTA
TTGGAACGAA TTCAAGCCGA AAAACAACGC CTGATTGCAG CGGGCCAACT CAAGCCCGAA
AAAGCGCTTA CGCCCATCGC CGCCAGCGAG CTACCATTTG GCTTACCCAA GGGCTAG
 
Protein sequence
MNELITALDQ LAETPQAISR IRELILQLAV QGRLVEQDPN DEPAYNMFKP LIKEQQMLNI 
DVRSSINKEH TKFQIPPSWI WVSLDDIVVY DAGSKHDPNN LDPDSWLLEL EDIEKNTSVI
LGQFLVKERK PKSNKASFQK NDILYGKLRP YLNKVIVAHT SGFCTTEIVV LRPKLELSPF
YIQNFLKSPF FVSYVNQHSY GTKMPRLGTL DGKKASIPLP PLAEQQRIVA KVAQLMALCD
QLEQQQTSRE ALRQQVQQSA IKQLLSELAR PADAQQIAQP SQPERQTSLF TRPTLANQPI
EPLADDGLSI SSEQQLFFEQ FDDLHTTPKA IGQLRELILQ LAVQGRLVAQ NPSDEPASIL
LERIQAEKQR LIAAGQLKPE KALTPIAASE LPFGLPKG