Gene Haur_3866 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3866 
Symbol 
ID5735715 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4856045 
End bp4858591 
Gene Length2547 bp 
Protein Length848 aa 
Translation table11 
GC content51% 
IMG OID641281017 
Productglycosyl transferase family protein 
Protein accessionYP_001546628 
Protein GI159900381 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4745] Predicted membrane-bound mannosyltransferase  
TIGRFAM ID[TIGR03663] conserved hypothetical protein TIGR03663 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000236131 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCGCAA CATCTTCACG ACAGGTAGCC GTGCCACGCA CACAACGTCG CCGCTTCACG 
GTTGAACATC TGGCCTATAT TGGCCTTGGC TTGCTCTCGG TGCTGATGCA CGTGTGGGCG
CTTGGCGGTC GATCATTGCA CCACGATGAG ACGATTCATG CCTATTATTC GTGGTTGCTC
TATCGCGGCG ATGGCTATTT GCATGATCCG TTGACCCACG GGCCGTTTTT GTATTATTGG
ACAGCCTTGC AATACTTTTT GTTTGGCGAC AATGAGTTTA CGGCACGTTT GGCCGCTGCG
ACCTTTGGCA TTGCCTTAAC CCTCACGCCG TGGCTGTTGC GCAAAAATAT TGGGCGCGGC
ACAGCTTTGT TGATGGTGGG CTATCTGTTA ATTTCGCCAG TAACCTTGTA TGTCGGGCGT
TTTATTCGCC ACGATATTTT TGCGGTTACT CAAGAAATTA TCTGTCTGAT TGCGATCATT
CGCTATATTT CCACGCGCCA TGTGCGTTGG ATTTATATTT TCTTCGCTTC GTTTGCTTTA
ATGTTTGTAA CGATTGAGAC CTCGTATCTC TTTACCTTGA CCCTTGGTAG TTTTATTGTG
CTGGTCACCT TGTGGCAAGT CAATCGCAAG CTGTTGATTC TGTTTGGAGT CTATGGCTTA
TTGGCGGTCG CTTGTCTCAA AGGCATCCCT GATCATCGTG GCCTCTTGGT TACAGCTAAT
GGCAACCCCG CCCCAGTGCT CGATGCCAAT GGTCAAGAGC AATGGTTGCC GCTGCCGTTG
GTCACTGAAA GCCAAGCCTT GGTCGTGCGT AATCAAGGCG ATGATCTGTT TTTCGATGAC
ACCGCCAATG GTGGTTTCCG CCAAGGCTAC TTCTCGAAAT TGCGCGAAAC CTTGTTTGGC
ATTAGCGCTG CCGAGGTGCA ATCGAATCCA AGCCGCCTCT ATCACCAAGT TAAAAATCAA
ACGCTCTATG GCAACAACGG CGTGTTTATG CACATGCCAA TTACCGCCCT AACGCTGCTC
ACGCTGGTCT TTTTGATCGC GGTGATTGTG ATTATTTGGT TTTACAAGGG CAAAGATCAG
ACGCAAACTA TGTGGAAACG GGCGGTCAGC CAAGCTCCCG AGCGTAGTTT ATTGCCCGCT
CTAGATTCGT TGTGGAGTAT GCACGGGGCG CTGGCGGTGA TTTTGGGACT AGTGATTTAC
GCCGCTTTCT TCACCTCGTT TGGTGTGCAT CCGGTTGGCG TGGTTTCGGG GATTGCTGGT
TCGTTGCTGT ATTGGGTGGC CCAACACGAT GTTGAGCGCG GCGGTCAGCC CAGCCATTAT
TATTTTGTGC AATTGCTGGT GTACGAGCCA TTACTGCTGT TTGCTGGCTT TGCGGGAACC
TTGGCAGGCA TCGGCCACTT GGTGTTGCAA ATCAAACGAG GCGCTGCCTT GACGGCTCAG
CGCATGGCGC CTGGTTTGTT GGCGTGGTGG GCGGCTGGCT CGTTTGCTCT GTATAGCTGG
GCTGGCGAGA AAATGCCATG GATCACCTTG CACGTGGCCG TACCATTGAT TTTTATTGCA
GCTTGGGGCA TTGGGCGGAT TTTCACTTGG GGCTTTCAGC CAATTCAAAA AGCTTGGCTT
AGCAGCAAAA TCCCAACCCG CCGCGATGAA TGGCTTGGGC TATTCGGCTA TTTGGCGGTA
GTGGGTTTGG TTGCTAGCTA TGCACTCATG CAATTAGTGC GGATGATTCG CATCCAACCC
AATATTGCAC CAACTGGTAG CCCCGCAACG CCGATGTTCC TTGTGAGCAT GATCTTGATC
ATGCTAGCAA TTACCGGTTT TTATAGCTTG TTCCATGGCT GGCGGCGAGC GCTCACAGGC
TTGACCTTGG CCTTAACCAT TATGTGGAGT GCCTATTCAT TCCGCTCGGC TTGGCGCTTG
AACTACCAAA ATGGCGATAT TCCGGTAGAA ATGCTGGTGT ATGTGCAAAG TTCGCCCGAT
GTTGGCCGCG TGATGGACGA TTTGCGCAAA ATCTCGGTTG CTGAGACAGG CCGCATGGAA
TTGCCGATTA TGTATGATAA CGAGCAAATC TGGAAGTGGT ATGTGCGCGA ATATACCAAA
GCGGTTGGCT TCTCTGGCTC GATGAATAGC CCAGCCACCG CTGAAACAGC AGCAATTTTG
ATGATCGACA GCAATTGGTC AACCAACGAA GCCAATGTCC AAGGCTTCCT TGAAGGGCGC
TTCCCATTGC GTTGGTGGTT CCCTGAAAGC CAGTTTTATC GCTTTGCCGA AGTACCAGAA
CTTGATGCCA ATGGTCAAAC CATGCGCGAT AGTAGCGGTG AGCAAGTGAT GAAAGCTGCG
CCATACGAGC AAGATTCGAC GATTGGGCGA TTAATCCGCG ACCCATTTGA TGCTAAAACC
CAAAATGAGC TTTGGCGTTA CTTGCTGTTC CGTCAGCCGC CAGGCCAACT CTCATCGGTT
GATTTTAAAG TCTATGTTCG CCCGCGCTAT GCCCACGTGT TGGGCGTGCA AGCTCAAAAC
GTGAATGGCC AATCGCAGGT GCGTTAG
 
Protein sequence
MIATSSRQVA VPRTQRRRFT VEHLAYIGLG LLSVLMHVWA LGGRSLHHDE TIHAYYSWLL 
YRGDGYLHDP LTHGPFLYYW TALQYFLFGD NEFTARLAAA TFGIALTLTP WLLRKNIGRG
TALLMVGYLL ISPVTLYVGR FIRHDIFAVT QEIICLIAII RYISTRHVRW IYIFFASFAL
MFVTIETSYL FTLTLGSFIV LVTLWQVNRK LLILFGVYGL LAVACLKGIP DHRGLLVTAN
GNPAPVLDAN GQEQWLPLPL VTESQALVVR NQGDDLFFDD TANGGFRQGY FSKLRETLFG
ISAAEVQSNP SRLYHQVKNQ TLYGNNGVFM HMPITALTLL TLVFLIAVIV IIWFYKGKDQ
TQTMWKRAVS QAPERSLLPA LDSLWSMHGA LAVILGLVIY AAFFTSFGVH PVGVVSGIAG
SLLYWVAQHD VERGGQPSHY YFVQLLVYEP LLLFAGFAGT LAGIGHLVLQ IKRGAALTAQ
RMAPGLLAWW AAGSFALYSW AGEKMPWITL HVAVPLIFIA AWGIGRIFTW GFQPIQKAWL
SSKIPTRRDE WLGLFGYLAV VGLVASYALM QLVRMIRIQP NIAPTGSPAT PMFLVSMILI
MLAITGFYSL FHGWRRALTG LTLALTIMWS AYSFRSAWRL NYQNGDIPVE MLVYVQSSPD
VGRVMDDLRK ISVAETGRME LPIMYDNEQI WKWYVREYTK AVGFSGSMNS PATAETAAIL
MIDSNWSTNE ANVQGFLEGR FPLRWWFPES QFYRFAEVPE LDANGQTMRD SSGEQVMKAA
PYEQDSTIGR LIRDPFDAKT QNELWRYLLF RQPPGQLSSV DFKVYVRPRY AHVLGVQAQN
VNGQSQVR