Gene Haur_1122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1122 
Symbol 
ID5733014 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1284694 
End bp1286736 
Gene Length2043 bp 
Protein Length680 aa 
Translation table11 
GC content51% 
IMG OID641278261 
Producthypothetical protein 
Protein accessionYP_001543898 
Protein GI159897651 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.180103 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTACGAG GTGTCTTCCC AAGCCAGCGC CTAGGCTGGC TTTGGCTTTT AGGCTTGCTT 
GGAACAGCCC TAGCGCTCAT GCTCAAGGTT CAAGCCCAAC TTGAGTTCCG CCATTTCAAT
GGGGCAGTTA TGCTTGGGGC AGCGGTTGCG CTCGGCATTT GGTGGTGGCT GATCAAGCGC
CAACCGCCAC AAGCTGATAT CTTGCCTGAA CTCCATGATC AGCCGTTAGA TCACCTAACG
TTGCGGCTTG GCATAAGCGC TGTATCCTTG CTTAGCGGTT TCGTCGCATG GAATAATCTC
TTTGGCAATG AATTTAACTC ACTCTCGACC TGGACATGGC TGTTTGCGAT TGGCTTATGG
CTGTTGGCAT GGATGCCCTG GCAACGCCCA CGCTTGCCCA AAGCTGATCG AGCCGAAACC
CAACGGGCAT GGCTGACCTT CGGCTGTTTG CTGATTGTCA CCGCTTTTGG TTTATGGATT
CGGCTGTATC GGCTGGATCA AATGCCATAT GACATGACGG TTGACCACGG CTGGAAGATG
GAAGATGTTT ACACCATTTT GCAGGGTGGT CGCCCATTAT TTTTGCCCAA TAACACAGGT
CGCGAGCCAG GCCAATTCTA TTACATTGCC ATGTTGATTC GCTTTTTTGG CGTGCCGTTT
GGTTTTATCG CGCTCAAACT TGGCAATGTG ATTATCGGCA CGCTAACAAT TCCGTTTATC
TATTTGTTTG CGCGTGAGCT GGGCGGTCGC AAGTTGGGCA TTTTGAGTGC CGCCTTGTAT
GCGCTAGGCA AATGGCCGCT CGAAACCACT CGCATGGGGT TGCGCTTTCC TTATGCCACC
TTGCCCGCCG CGCTGGTGTT GTGGTCACTC TGGCGCTATG TGCGCTTGGG CAAACGCAGC
GATGCCTTGC TGGTCGGGTT ATGGATGGGC ATGGGTTTAT ATGGCTATAT CGGCGTTCGA
GCAGTGCCCT TCGTCATTGC CGCCGTATTT GGCCTGATGC TGTTTGAGCG ACGACGACGC
AACCCCAAAG GCTGGCTCAA ATTGCTCGGC CATGGCAGCC TGACGTTAAT CACAACCGCC
TTGATTTTCT TGCCGCTCGG CCATTTTATG CTCGATTACC CCGATGTTTT TTGGTTTCGC
GTCAGCACCC GCACCAGCAA CCATACCGAC GATATTAGCC GCGAATTTTG CCAAACTACC
AGCAGCGAAC GTGAATGCGA TATTAAAAAA TTCGTTGCCA ATAATGTTAA TTTAGCGGTC
GCGTTTAACT GGCGTGGCGA TCGCAACGAA GTTAATAATG TGCGTTTCGA TCCATTGCTT
GATGTTGTCA GCGCGGCTTT GCTCTTGTTA AGTTTGCCAA TCGTCGTGTG GCGGTTGTTA
GTTGAACGTT CATGGCGTTG GTGGATGCTG GTCGTCGCAT TGCCATTACT TGGTTTAGCC
ACAACGTTGA GCTTAGCATA CCCAATCGAA AACCCCAGCG CAGCGCGGAC TGGCGTGCTG
ATGCCAGTGA TTTTTACCAT GGCCGCTGCA CCGCTAGCCT TGGCGCTTGA ATGGCTAACC
AAAGGCCAGC CGTTTGAGCA ATGGTGGCGC GGCAAATCAG CCTTGGGTGG ATTGCTGAGC
ATTGGCTTAA CAATCTGGCT ACTGGCATGG GCTGGACGCG AAAATTTCCA GCGCTACTTT
GTTGATATGG CCCGCCAATA CACCGGTTTT ATCCCGAATA ATCGTGAGGT TGCCGATGCG
ATTCGCTACT ATCGCGATGC TCAAGGCGTA CCCTACGAAA ACGCCTACCT CATGCTCAAC
AGCTATTTCT GGAAGGAATC GCGCAATATC AGCGTGCATC TAAATGATAT GCAATGGTAT
GTCAACAACA CGATTAAGCC CGAAATGGCG CTGGTTGTGC CTGGCAATCG CCCATTAATT
TACATTCTCA ATCCTGACGA TCAAGCCCAT ATTGATCAAT TGCAACAGGA ATATCCTAAA
GGCGAGTTGC GCCGCATCAG CAGCGCCGTT GGCAAGGATT TTCTGGTGTT TCATTTACGC
TAA
 
Protein sequence
MLRGVFPSQR LGWLWLLGLL GTALALMLKV QAQLEFRHFN GAVMLGAAVA LGIWWWLIKR 
QPPQADILPE LHDQPLDHLT LRLGISAVSL LSGFVAWNNL FGNEFNSLST WTWLFAIGLW
LLAWMPWQRP RLPKADRAET QRAWLTFGCL LIVTAFGLWI RLYRLDQMPY DMTVDHGWKM
EDVYTILQGG RPLFLPNNTG REPGQFYYIA MLIRFFGVPF GFIALKLGNV IIGTLTIPFI
YLFARELGGR KLGILSAALY ALGKWPLETT RMGLRFPYAT LPAALVLWSL WRYVRLGKRS
DALLVGLWMG MGLYGYIGVR AVPFVIAAVF GLMLFERRRR NPKGWLKLLG HGSLTLITTA
LIFLPLGHFM LDYPDVFWFR VSTRTSNHTD DISREFCQTT SSERECDIKK FVANNVNLAV
AFNWRGDRNE VNNVRFDPLL DVVSAALLLL SLPIVVWRLL VERSWRWWML VVALPLLGLA
TTLSLAYPIE NPSAARTGVL MPVIFTMAAA PLALALEWLT KGQPFEQWWR GKSALGGLLS
IGLTIWLLAW AGRENFQRYF VDMARQYTGF IPNNREVADA IRYYRDAQGV PYENAYLMLN
SYFWKESRNI SVHLNDMQWY VNNTIKPEMA LVVPGNRPLI YILNPDDQAH IDQLQQEYPK
GELRRISSAV GKDFLVFHLR