Gene Haur_2185 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2185 
Symbol 
ID5734072 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2770510 
End bp2771736 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content46% 
IMG OID641279326 
Productglycosyl transferase group 1 
Protein accessionYP_001544953 
Protein GI159898706 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAAACC GTGGCTTAAG GCTCGTCATC GTGACAACCT TCCCTCCAAG CACTGGTACG 
CTTAATGAGT ATGCCTGGCA TTTCGTGCGG GCTTTTCAAC AAAAGCGCGA AATTCGTGAG
ATTATTGTCT TAGCCGATGA GCTTGCCAGG CCAGAACCAG AGCTACCAAA TCAAATTAAT
GGTATTTCGC TCCAAATACA GCGTTGTTGG CAGTTTAATC GTTGGGATAC TGCCCAGCGG
CTTATGCGTG AGATTGAACA AATTAACCCT GACGCCGTGT TATTCAATTT GCAATTTGCA
ACCTTTGGCA ATCGCCGTAT TCCAGCAGCC TTAGCACTAT TAACGCCAGC CTTGGTTCGC
CAAAAAGGCA TTCCAAGTAT TGTTTTATTG CATAACATCA TGGATACAGT GGATCTTAAG
CAGGCTGGGT TTGCTGGCAA TCAATTAATC AATTGGCTTA CAAAATTCGC TGGGCGCATT
ATTACCCGTC AATTATTACG CGCTGATTTA GTTGCGGTAA CAATTCCACG CTATGTCGAA
ATCTTAGAAC AAACCTATGC AGCAACGAAT GTGCTTTTAG CTCCCCATGG AGCCTTTGAA
ACGCCGCCAC CTCCAACGGC TCCAAGCACC GATTGCTGGC GCTTTTTAGC TTTTGGAAAA
TTTGGTACTT ATAAAAAAGT TGAAACCCTG ATTGAAGCAT TTGCTCAAGT CCAAACCAAG
CATCAACAAC CCATGGAATT AGTGATTGCA GGCTCTGACA GCCCAAACGC AGCTGGTTAT
TTAAACACTG TTCAACAGCA ATATGCCCAT GTTCCTAATG TACTCTTTAC TGGCTATGTT
CCAGAAGAAG CCGTTGCAGG CTTGTTTCAA TCGGCAACAG CCGTTGTTTT TCCCTATACC
AGTACCACTG GAAGCTCGGG AGTGCTCCAT CAGGCAGGAT CATATGGTCG CGCCGCGATC
TTGCCAAACC TCGGCGATCT TGCCGATATT ATCAGCGAAG AGGGTTTTGA TGGCGTGTTT
TTTGAACCTG AAAACGTTAC CAGCCTCGCT TTAGCCCTTG AGCAATTGCT GCTTGACCCT
GCAAGATGCC ATGCGTTAGG GATGCAGAAC TACGCTGCTG CTTGTGGCTT ACCAATTAGT
GATGTTGCTG ATTGGTATCT TTTGCATCTG CAAACCTTGC TGCGTCAGCC AGCAATTGCA
CCAAGATTTC AACAGGAGGT TTTATGA
 
Protein sequence
MPNRGLRLVI VTTFPPSTGT LNEYAWHFVR AFQQKREIRE IIVLADELAR PEPELPNQIN 
GISLQIQRCW QFNRWDTAQR LMREIEQINP DAVLFNLQFA TFGNRRIPAA LALLTPALVR
QKGIPSIVLL HNIMDTVDLK QAGFAGNQLI NWLTKFAGRI ITRQLLRADL VAVTIPRYVE
ILEQTYAATN VLLAPHGAFE TPPPPTAPST DCWRFLAFGK FGTYKKVETL IEAFAQVQTK
HQQPMELVIA GSDSPNAAGY LNTVQQQYAH VPNVLFTGYV PEEAVAGLFQ SATAVVFPYT
STTGSSGVLH QAGSYGRAAI LPNLGDLADI ISEEGFDGVF FEPENVTSLA LALEQLLLDP
ARCHALGMQN YAAACGLPIS DVADWYLLHL QTLLRQPAIA PRFQQEVL