Gene Haur_3825 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3825 
Symbol 
ID5735689 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4800732 
End bp4803611 
Gene Length2880 bp 
Protein Length959 aa 
Translation table11 
GC content53% 
IMG OID641280977 
Producthypothetical protein 
Protein accessionYP_001546589 
Protein GI159900342 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.501815 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCTGGT ATAATGGCAC GTTGTTGTCG ATGAAACGAT TTGAGGCTGG CTTGGTACAA 
GACTGGTGGC AACGCCGCGC TGCATGGTCG CTGCGTCAAT GGATCATTAC AATCGCAAGT
GTGTTGCTGA TGGCGCTATG CTTATTAACC GCGCTCTATC AGTTGCCGAC TCATGTGCGG
ATCGCCAGCG ACGGCTTGGG TGATCAGCCG TTTTTGGTTT CAAGCGAGGC ACTAGGCCAA
GCTGCCACCG ATCTTGGCAC GGTCTTCCCC GATGAACTCG ACGCAACTGG CGGGCGGTTT
CGCTGGACAC GCGGCAGCAC CCAAATTCAG ATTCCGGCAC TCGGCGCACA TAGCAGTTAT
AGGCTTGAGC TTGATGTAGT TGGCTGGCCC GACGATGTGC TACGCAGCGA TGTACGCCAG
CCATTTGTGC ATGTGTTGGT CAATCAACAG CTGATCGCCG TATTTGAGGC CAGCAGCCAA
CGCACCATCT ATCAATTAAA TACTCCAGCA CTCAATCAAA GCAACCTTGA GCTTGAATTA
CGCCTGAGCC ACGACCCACA GCAGCTTGCC CCCTTGGATA ATACTGCCAC CTTCACTGGC
ACGACGCTCT ATCCCAACGA CCGCCGCCCC CATGGTTTGC GCTTGTATGG CCTTAGCGCC
ACGACCAACA CTGTAGGCTT TAATTTGCCT GCCTGGAGCT TGATTTGGCG TGTAGCAGTG
GCCTTGGGCT TGATCCTCCT GGCTGGTTGG CTCTATCCGC GTGAGCGTTT GCCATTGTTT
TTGCTTGGGT TGCTCTTGGT CGTTTTTTGG CTGACCCTGG GCTATCTTGC CCGCCACTGG
CTGTTGCCAT CGTTTGAGTT GGTGCTGTTG GGGCTTGGTT TTGTGGTGGT TTGGAATTGG
CAACGCCAGA TTATTGATAC CTGGTTGCAT TTCCGCCAGC GCTTGTTGCA AGGCCAAAAT
CTTGATTATG GTTTAGCTGT GGCTGCACTG ATTGGCCTGC TGGCTATCAG TTGGCCCAGC
CTTAGCCAAG CCGCCAACGA ATGGCTACCC AAAGCCAAAA AGCTCGATCC TGGGGCGATT
ATTGTGGTTT CGTTGGCAGT AGCTAGCTTG TTTATTGCAG GCTTCTACTG GGGCGATCTC
AATCGGTTGC TCGATCGGCT CAATCGGCGT TTGCGCACCC GCCAAGGCTT GGGCTGGGCC
TTGCTAGGCT TAGTGGTTGG GGTTTGGTTA GTCTATAGCT GGGGCGTAAT TCGCCAAATT
AACTACCTTG GCAACGCCGA TTACTCCGAT AACGGTGTAG TTGCCCGCAA CTTGGTGGCG
GGGCGTGGCT GGGTCGTTGA TTATGTCACC CAATTTTTCA AACTGTATCC CGATGGCAGC
GTGACCCGCG TGCAAGAAAC CTGGCCAATG TTGCAGCCAG TCTGGATTGC GCCATTCTTT
GCCTTGTTTG GGCCAGAACC GTGGGCCGCC AAAATCCCCA ACCTGATCTT TTTTAGTTTG
CTGTCGATCG TGGTCTATCG TATTGCCCGT CAGTTGTGGG ATCAACGGGT TGGTTTGATT
GCGGTGCTGT TGGTGCTGAT CAATCGACAT ATGTTCCGCT TGATGATCTA TAGCACCTCG
GATTTGGCGT TTGTGCTGTT CTACACAGCG GCGATTTGGT TGCTCTGGCG TAGTTTGGTA
ACTCATAGCA AAGAGCGCTT GCTTGGCTCA GGCCTGATCA TCGGCCTCAT GTGTTGGCAA
AAAACCAGCG CCGTAATTGT AGCGATTGGC ATGGGCTTGT GGCTGATTTG GCGTTTGTGG
CAGGTCGAAG ATCGCTGGAA AGCCTATCGC ACTGCTGCTT TGTGGTGGGT ACTCCCAGCG
GTTTTGGTGT TCTCACCTTA TATTGCCCGC AACCTGCACG AATTTGGCAA ACCCGCCTTT
TCGACCGAAA GTTATGATGC TTGGATCATC GGCTACACAA ATTTCGATTC GATCTACAAT
ATCTACACCA ACGAGCAAGG CTTGCCCGGG AGCAACGGCT TACCTGAACC AAGCTGGATT
CTGCGTTGGG GCTATCAAAA GACCTTTGAT AAAATTGGTA ACCAATTTGA AGCGACTCGC
AACTATTTGT TGCCAGCTAG CCCAGCTTTG GGCATGTTCA GCGGTCGCGG CAATTTGATG
GGCAATAACG ATCAGGGCGT TAATGGTCAG CCCTATGTGT GGTTGATGTT GGGTGCGTGG
CTAAGCTTAA TTGGTTTGAT TGCTGCTCGC CGCCAACAAG CCAGCTTGAT CGCCTTGGTT
GGAGCGAGTT TCACACCATA TATTATCTTC TTGGCGCTCT ATTGGCACGC TGATGAAGAA
CGCTATTTTG TGCCATTAGT GCCATTTTTG GCCTTACTGG CGGCTGGGGC GTTGGTGGCA
ATTCACGATG CAATTGCTCG TTGCTGGAAC CAGCGTGGTC GTCCGTTGGC CTTGTTGGTA
GCGGCTCAAT TGCTGGTCTT GGCGCTCACC CCGGGGTGGG TTGAGGCCGC TAACAAAAGT
GCTACCGTCG CAGGCAGCGA GTATGCCGAA TGGCAACCCG ATTTGCAGGC CTTTGAATGG
TTGCGCCAAA ACACACCACC AGAGGCGGTG GTTATGACCC GCGTGCCATG GCAACTCAAT
TTTCATGCTG AACGCGCTGC TGTGATGAAC CCAAATGTGG CCGATTTAGC CGTGATTAAA
CAGGTTGCTG ACTATTACAA GGCCTCGTTT ATTCTGGTCA ATGCTGTGCA AAACAACAAA
GATCAAGCTC AAATTGGCTT AGGCCGCTTG CTCAAAGGCG AGGAGCTACC TGGGTTTGTG
CTACGAGCCA GCTTTGCGGG GCCGAAAAAC CGCACAGTCT ATATCTACGA AATTCAATAA
 
Protein sequence
MLWYNGTLLS MKRFEAGLVQ DWWQRRAAWS LRQWIITIAS VLLMALCLLT ALYQLPTHVR 
IASDGLGDQP FLVSSEALGQ AATDLGTVFP DELDATGGRF RWTRGSTQIQ IPALGAHSSY
RLELDVVGWP DDVLRSDVRQ PFVHVLVNQQ LIAVFEASSQ RTIYQLNTPA LNQSNLELEL
RLSHDPQQLA PLDNTATFTG TTLYPNDRRP HGLRLYGLSA TTNTVGFNLP AWSLIWRVAV
ALGLILLAGW LYPRERLPLF LLGLLLVVFW LTLGYLARHW LLPSFELVLL GLGFVVVWNW
QRQIIDTWLH FRQRLLQGQN LDYGLAVAAL IGLLAISWPS LSQAANEWLP KAKKLDPGAI
IVVSLAVASL FIAGFYWGDL NRLLDRLNRR LRTRQGLGWA LLGLVVGVWL VYSWGVIRQI
NYLGNADYSD NGVVARNLVA GRGWVVDYVT QFFKLYPDGS VTRVQETWPM LQPVWIAPFF
ALFGPEPWAA KIPNLIFFSL LSIVVYRIAR QLWDQRVGLI AVLLVLINRH MFRLMIYSTS
DLAFVLFYTA AIWLLWRSLV THSKERLLGS GLIIGLMCWQ KTSAVIVAIG MGLWLIWRLW
QVEDRWKAYR TAALWWVLPA VLVFSPYIAR NLHEFGKPAF STESYDAWII GYTNFDSIYN
IYTNEQGLPG SNGLPEPSWI LRWGYQKTFD KIGNQFEATR NYLLPASPAL GMFSGRGNLM
GNNDQGVNGQ PYVWLMLGAW LSLIGLIAAR RQQASLIALV GASFTPYIIF LALYWHADEE
RYFVPLVPFL ALLAAGALVA IHDAIARCWN QRGRPLALLV AAQLLVLALT PGWVEAANKS
ATVAGSEYAE WQPDLQAFEW LRQNTPPEAV VMTRVPWQLN FHAERAAVMN PNVADLAVIK
QVADYYKASF ILVNAVQNNK DQAQIGLGRL LKGEELPGFV LRASFAGPKN RTVYIYEIQ