Gene Haur_3709 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3709 
Symbol 
ID5735573 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4664340 
End bp4665710 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content52% 
IMG OID641280861 
Producthypothetical protein 
Protein accessionYP_001546473 
Protein GI159900226 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0516593 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCACAA TTATGTGGAT TAAGAATCAT CGTTGGCTGA GTTCGTTATT GGTAGCCGTG 
GTTTGCCTTG GTTTAGGCGG GCTGGCAGGT TTGATGCCGT TGCGCTATTT GGCAGGCGTA
GCCTTTTGCC TTGGCCTAAC CGCAATTGGT TATGCTTGGG TTTGGCTAAT TAGCAAGCCC
GACGAGTTGC GGGGTATGCT GCTGCTCTGT TTTGCAGCGA TGGGCCTGCG TTGGGGAGCC
AGCTTTGCGC TCGAATTTCT CTGGCCCAGT TTTGAATCGC TGAGTGATGG CGCAGCGTAT
GGCCCGCATG CCATGACGAT TGCCCAAGCT TGGAACGCTA ATTATTTCGC TAGTTACGAG
GCGGTGGTTT CAACCCCGGT TGGTGCGCCG GGCTATGTTT ATTTTTCGGC AGTAATTTTT
TGGTTATTTG GGCCAAATAC CTTGCTGGTA AAATTGGCTA ATGGCTTGTT TGCTGGCATG
GCGGCGGTGT ACACGGCCAA ATTAGGCAAC CATTTTTTCG ATCAACGGGT TGGGCGGTTT
GCCGCGTTGT GGATGTTAAT TATGCCATCG CTGATTTTGT GGACTTCGCA AAATCTCAAA
GATAGCGCGG TGGTGCTGCT CTCAGTCTGG ATTTTGTATG TAGCCAGTCA AGGTTTGCGC
TCGTCGTTAT GGCAAATTCC ATTGTTGGTG CTGCTGATTG GGGCGTTGAT GAGCGTGCGG
CGCGAAACCT CAATTGGCAT TGCTTTGATG ATTGCCTTGA CAATTGGCTT TCAGCAAACC
CGTCATTGGC TAACCCGCCT GAGCTTGAGC GCGATCACGA TTGTGGCCTT GGGCTTGGTG
CTTTCGAGCA GCGGCTATGG CTTTTTGGGC AGCGATTATC TGCAAGAGCG GCTTTCGCTC
AGCGCAATTA GCGAAAAACG CGAGGCCAAC TCAACCGGCA CAGGCACAAT TGAAAATACG
ATTGATACGA CCACGCCGCT GGGTTTTGCC CGCTACTTGC CAATTGCCTT GATTAATTTC
TGGTTGCGAC CGTGGCCGTG GGAAGCTACC AAGAGCACCG CCCAATTGCT GACCATTCCC
GAAGCGGCGC TGTTGTGGTA TCCGTTGTGG GTTTTGGCGA TGATTGGCAT GATCTTGGCG
TGGCGTAGCC GTTGGCGCGA AACAATGTTG CTCTGGCTCT ATCTGCTGGC TGGCAGTGCG
GCGGCGGCTC CGCAGTATGG CAATTTTGGC ACGGCCTATC GCCATCGGGT GCAGCTGTGG
CCAATTTTCT TTTTGTTTGC AGGCTATTGT TGGTATCGTT GGCGCGATGC TAGAGTTGAG
CAACGTCAGG CGTTGTTGAC CCGCTATGTG CAGAGTATCA AGCAAATTTA G
 
Protein sequence
MRTIMWIKNH RWLSSLLVAV VCLGLGGLAG LMPLRYLAGV AFCLGLTAIG YAWVWLISKP 
DELRGMLLLC FAAMGLRWGA SFALEFLWPS FESLSDGAAY GPHAMTIAQA WNANYFASYE
AVVSTPVGAP GYVYFSAVIF WLFGPNTLLV KLANGLFAGM AAVYTAKLGN HFFDQRVGRF
AALWMLIMPS LILWTSQNLK DSAVVLLSVW ILYVASQGLR SSLWQIPLLV LLIGALMSVR
RETSIGIALM IALTIGFQQT RHWLTRLSLS AITIVALGLV LSSSGYGFLG SDYLQERLSL
SAISEKREAN STGTGTIENT IDTTTPLGFA RYLPIALINF WLRPWPWEAT KSTAQLLTIP
EAALLWYPLW VLAMIGMILA WRSRWRETML LWLYLLAGSA AAAPQYGNFG TAYRHRVQLW
PIFFLFAGYC WYRWRDARVE QRQALLTRYV QSIKQI