Gene Haur_4855 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4855 
Symbol 
ID5736701 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp6186585 
End bp6187613 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content53% 
IMG OID641282021 
Productphosphoribosylaminoimidazole synthetase 
Protein accessionYP_001547613 
Protein GI159901366 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0150] Phosphoribosylaminoimidazole (AIR) synthetase 
TIGRFAM ID[TIGR00878] phosphoribosylaminoimidazole synthetase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCACCT ATAAGGATGC TGGCGTTGAT ATTGCAACCA AAATGGATGC GATCCAACAG 
ATGGGAGCCG CAGTTAAGGC CACCCATACG CCAGCAGTGT TGGCAGGTTT AGGCGCGTTT
GGCGGCTGTT TTGATTTGGC TCAGGTTAAT GCAAGCCATG CGCAGCCAGT TTTGGTAGCC
TCGACCGATG GGGTTGGCAC CAAAACAGCG GTGGCCGCAG CCGTTGGCGA TGTACGCACG
ATTGGCGCAG ATCTCGTTAA CCATTGTATT AATGATATTT TGTGCCAAGG CGCAACGCCA
CTCTTTTTTC TCGATTATAT TGCGGCCTCA AAGCTTGAGC CAGCCATGGT TGTGGCCGCA
GTTGAGGGGC TAGCAGCAGC CTGTCGTGAT GCAGGAATTG CCTTGCTTGG CGGCGAAACT
GCCGAAATGC CAGGCGTTTA CCACGATGGA GCCTTTGATG TAGCAGGAAC AATCGTCGGT
GTGGTTGATC GGGCGCATAT GCTTGATGGC AGCGCAATTA AGCCAGGTGA TGTGGCAATT
GCCTTGCCTT CGACGGGCTT GCACACCAAT GGCTATTCGT TGGCGCGAAA AGTTTGCGCA
CCCTTAGGCT ATGCCAGCCA ACCAACAATT TTAGCTGGTT TGAGCATTGG CGAGGCTTTG
CTGGCTCCGC ATCGGGCCTA TTTGCACGAA GTTCAGGCCT TGCGTCAAGC AGATGTGGCG
ATTCACGGTT TGGCCCATAT CACTGGCGGC GGCATTTGGG ATAACATTCC GCGGGTATTG
CCAGCCAACG TGACAGTCGA ATTAGTTCGT GGTTCATGGC AAGTTCCAGC AATTTTTAAA
TTAATTGTGG AACAAGCGGC GATGGATGAA CACGAGGCCC ATCATGTGCT CAATATGGGC
TTGGGCATGA TTCTATTTAT TGCGGCTGAA CAGGCTGAGC AAGCACTCGC AACCATCAGC
GATGCCCAAC TGGTCGGACG GGTGATCGAG CAAATTAATC AACCGCGAGT GGTGCTAGTA
GATCACTAG
 
Protein sequence
MTTYKDAGVD IATKMDAIQQ MGAAVKATHT PAVLAGLGAF GGCFDLAQVN ASHAQPVLVA 
STDGVGTKTA VAAAVGDVRT IGADLVNHCI NDILCQGATP LFFLDYIAAS KLEPAMVVAA
VEGLAAACRD AGIALLGGET AEMPGVYHDG AFDVAGTIVG VVDRAHMLDG SAIKPGDVAI
ALPSTGLHTN GYSLARKVCA PLGYASQPTI LAGLSIGEAL LAPHRAYLHE VQALRQADVA
IHGLAHITGG GIWDNIPRVL PANVTVELVR GSWQVPAIFK LIVEQAAMDE HEAHHVLNMG
LGMILFIAAE QAEQALATIS DAQLVGRVIE QINQPRVVLV DH