Gene Haur_4822 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4822 
Symbol 
ID5736667 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp6148920 
End bp6149900 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content53% 
IMG OID641281987 
Productnucleotidyl transferase 
Protein accessionYP_001547580 
Protein GI159901333 
COG category[J] Translation, ribosomal structure and biogenesis
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1208] Nucleoside-diphosphate-sugar pyrophosphorylase involved in lipopolysaccharide biosynthesis/translation initiation factor 2B, gamma/epsilon subunits (eIF-2Bgamma/eIF-2Bepsilon) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTGCAG TTATTTTAGT TGGCGGATTA GGCACACGCC TCCGCCCATT GACCAATCAA 
CTCCCCAAGC CACTTGTGCC AATTGCCGGC GAAGCCTTGA TGAGCCGAAC CTTGCGGCGT
TTGTATAAGC AAGGGGTGCG TCATGTGATT TTGGCGGTGC AATATTTGGC CGAACAATTT
TTGGCGGCCT ATGGCGATGG CGCGGCTTTT GGGCTAGATT TGCAGATTGT TCAAGAGCCA
GAAGCCCTAG GCACAGCTGG CGCAGTACGC TACGCCCTTG ATCAAACCAA TTTGCTTAAG
GCTGGGCCGA TTTTAGTGCT GAATGGCGAT GAACTGACTG ATTTCGATGT GGCCCAACTC
TGGCAAGCTC ATGGCCAATT TGGCGGTGTG GCGACGATTG CCGTGCGCCA AGTGGCCGAT
ACCTCAGCCT TTGGGGTAGT TGCTAGCGAT GCGAATCAAC GAGTGTATGC CTTTCAAGAA
AAACCTGCGG CTGGCACGGC CTTGGCCAAC ACCATCAATA GCGGAGCCTA TGTATTTGAG
CCAGCGGCAC TTGCCCAGAT TCCAGCCCAA GGTTTTGCTA TGCTCGAACG CGATCTCTTC
CCCAGCTTGC TAGCGACTCA AGCCCTGATT TACGCCTATC AACACAACGC CTACAGCCAA
GATATTGGCA CATTGGCAGG CTATTTAGCC GCGAATGAAG CGGTATTGTT GGGCCATTTG
CCGCATGAAA CCGTGCATGG CATACAATAT GCAGCAGGAG TGTGGGCTGC GGCTGATGCT
CAAATCAGCC CTAGCGCCCA ATTAATTGCC CCGATTATGC TTGGCAGTGG CTGTGTGGTG
GGCGAGCATG CCCGACTTGA ACGGGTGATC GCATGGGATC GTGTTACAAT TGAAGCCGCT
GCAAACCTAA ACAATGTCGC CATTGCCAAT GATGTGCAGG TTGCCCACCA TGCAACTGTC
GAAGGTCTCG CGCTTGGTTA A
 
Protein sequence
MRAVILVGGL GTRLRPLTNQ LPKPLVPIAG EALMSRTLRR LYKQGVRHVI LAVQYLAEQF 
LAAYGDGAAF GLDLQIVQEP EALGTAGAVR YALDQTNLLK AGPILVLNGD ELTDFDVAQL
WQAHGQFGGV ATIAVRQVAD TSAFGVVASD ANQRVYAFQE KPAAGTALAN TINSGAYVFE
PAALAQIPAQ GFAMLERDLF PSLLATQALI YAYQHNAYSQ DIGTLAGYLA ANEAVLLGHL
PHETVHGIQY AAGVWAAADA QISPSAQLIA PIMLGSGCVV GEHARLERVI AWDRVTIEAA
ANLNNVAIAN DVQVAHHATV EGLALG