Gene Haur_3099 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3099 
Symbol 
ID5734971 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3908066 
End bp3909163 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content50% 
IMG OID641280243 
ProductrRNA (guanine-N(2)-)-methyltransferase 
Protein accessionYP_001545865 
Protein GI159899618 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG2813] 16S RNA G1207 methylase RsmC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTATGA ATGCATGGTT GTTGCCAGTA CCCTTTGATA CCGAACGTGA CGATTGGGGC 
GCAACTTTAT TGGCTGAATG GGGAGCCAGC GTGGTTACGC CAGGCCAGCA AGCCTTGGTG
ATCGGCGCAG GCACTGGCCG GATTGGCTTG GCATTGGCGC GAGCAGGCGC ACATGTCAGT
TTTGCTGATG ATTCAATTGT GGCCTTGGCC GCTGCCCGCC AGAGCTTTGC CCAAGCCAAG
TTGCCAGCCC AATTTTTTAG CACCACCGAC TTAACCCCAA GCAAACCGTT TGACCTGGTG
TTGATTAACA TTTTGTGGTG GAGCGATAAC CAGCGTGGCG CAGAACTGAT TAATTTGGCA
GCGCAACACA CCAATGCTGG CGGGATTGTG GCGATTGGCG GTGGTAAACA AGCTGGATTA
AGTGGAGCAA CCACGTTGCT CGAACAGATT GTCGGGCCAA GCGTCAAAAC GCTCTATAAA
AAAGGTCATC ATGTGGTAAT GGCATTTCGG CCTTTACACT GGCAAGCTCA GCCAAGCCAA
ACAACCCAGC ATCAGCTCAG TCATGGCGAC CATGCATTAA CTATCGAGGC AACTGCTGGC
GTATTTGCTC AAGGCCAACT TGATCCAGCC AGTGCTATGT TATTGGATGC AGTGCACATT
CAAGCCAACC AGCGCGTGCT TGATCTTGGC TGTGGCGCTG GAATTTTGGG CATGTTTTTG
CAACAGCGCG AATCCACTCT TGCTTTAACC TACATCGATA GTACCATGGT AGCAATTGAA
GCCACTAAAC GAAACTTACA AACCAACCAA TTAACTGGCC GAGTGCTGGC ATCTGATGGA
ATTCAAGCCG TCAATGGCGA GCAATTTGAT CTGGTCGTGT CGAATCCACC ATTTCATGTT
GGTCGAGTTC AAAGCCCACA ACTGGCCGAA AATCTCTTAA AGCAGGCTGT CCAAGTGTTG
GCTCCCAATG GCCAATTGGT GATTGTTGCC AATCGCTTTT TGCGCTATGA ACCATTGCTA
GAAACTGTTT TGAGTAATGT GCATGAGCTA GCAGGCGATC AACGTTATAA AGTTTTGGTT
GGTACAAAAG CAAATTAG
 
Protein sequence
MRMNAWLLPV PFDTERDDWG ATLLAEWGAS VVTPGQQALV IGAGTGRIGL ALARAGAHVS 
FADDSIVALA AARQSFAQAK LPAQFFSTTD LTPSKPFDLV LINILWWSDN QRGAELINLA
AQHTNAGGIV AIGGGKQAGL SGATTLLEQI VGPSVKTLYK KGHHVVMAFR PLHWQAQPSQ
TTQHQLSHGD HALTIEATAG VFAQGQLDPA SAMLLDAVHI QANQRVLDLG CGAGILGMFL
QQRESTLALT YIDSTMVAIE ATKRNLQTNQ LTGRVLASDG IQAVNGEQFD LVVSNPPFHV
GRVQSPQLAE NLLKQAVQVL APNGQLVIVA NRFLRYEPLL ETVLSNVHEL AGDQRYKVLV
GTKAN