Gene Haur_4328 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4328 
Symbol 
ID5736188 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5527629 
End bp5528774 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content49% 
IMG OID641281489 
Productphosphatidylinositol alpha-mannosyltransferase 
Protein accessionYP_001547088 
Protein GI159900841 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00438832 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATTG GCTTTTTGAT TGATGATCAT ATGCACCGCG CTGGGGGCAT TCAGGTATAT 
GTTCGTGGTT TGTATCATTA TTTTCAAAGC AAAGGTCATG AGGTGGTGAT TTTTGCTGGT
GGTAGTCAGT TTGATAATAG CAAATTGGCT GAGCGCGTTA TCTCGTTGGG CGTTTCAATT
CCGACGTATG GCAGCGGTTC GAGTACATCA CTGCCAATCT GCATCGAATC GAATCGCCGC
TTGCGCGAGA TTTTAGCCGA AGAAGCTTGT GATGTTTTGC ATGTTCAATC GTTGCACTCG
CCTACCTTGA GCGGGCGACT TTTGGCGAAT TCCAAGGCTT GCCATGTTTC AACCTTTCAT
ATTCGGGTTG ATGAGGCGTG GAAATTGCAG GCCTTGCGTT TGGCAACCGC ACTTGGCCCC
GATTTATATC GCCATATCCA TGGGCGAATT GCTGGATCAC AGGCGGCACT CGAAACAGCG
CAGGCAATTT TTGGCACCAA AGACGAGTAT ACAATTATTG CCAGTGGCAT AACGATTGAT
CGCTTTGATG CAGCGGTGAA TTTGCCACGC TTGCATCAAT ATAACGATGA TAAAATTACG
CTCTTTACGC TTGGGCGCTT GGAGCAACGC AAGGGCGTGG AGTATCTGCT GCGAGCTTAT
GCCTTGCTCC AAAAGGATTA TCCCAACCAA TTGCGCTTGG TGATTGCTGG CGATGGGCCA
TTACGCGAAG AATTACAAGC CTTGGCAGCC CAATTGCGCT TGACCGATGT CGAATGGCTG
GGCTATGTGA CCGATTTGGC CTTGCCGCAT TTGATGGCGA GTGCTGATAT TTTTTGTGCA
CCAGCGATTG GCCAAGAGAG CTTTGGTTAT GTATTGATTG AGGCCATGGC GGTTGGTTTG
CCAGTTGTGG CGGCAGCTAA TGCAGGCTAT GCGGGAGTTT TGGCCAATCA TCCAGGTAAT
TTAGCAGTGC CACCACGCGA TCCACGGGCA ATGGCTGGGG CGATTGCCAG TTTTGTTGCC
AGCCCTGCCG CCCGCAAACG TCTGCGCCAA CTAAATCTAC AAGCGGCCAA AGGCTATAGT
TGGCAGGTGA TTGGCGACCA AATTATGGAA TTCTACAAAA AAACAATGGC GCAAACTGTG
CAATAG
 
Protein sequence
MKIGFLIDDH MHRAGGIQVY VRGLYHYFQS KGHEVVIFAG GSQFDNSKLA ERVISLGVSI 
PTYGSGSSTS LPICIESNRR LREILAEEAC DVLHVQSLHS PTLSGRLLAN SKACHVSTFH
IRVDEAWKLQ ALRLATALGP DLYRHIHGRI AGSQAALETA QAIFGTKDEY TIIASGITID
RFDAAVNLPR LHQYNDDKIT LFTLGRLEQR KGVEYLLRAY ALLQKDYPNQ LRLVIAGDGP
LREELQALAA QLRLTDVEWL GYVTDLALPH LMASADIFCA PAIGQESFGY VLIEAMAVGL
PVVAAANAGY AGVLANHPGN LAVPPRDPRA MAGAIASFVA SPAARKRLRQ LNLQAAKGYS
WQVIGDQIME FYKKTMAQTV Q