Gene Haur_3431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3431 
Symbol 
ID5735292 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4320066 
End bp4321205 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content54% 
IMG OID641280578 
Productglycosyl transferase family protein 
Protein accessionYP_001546195 
Protein GI159899948 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0859] ADP-heptose:LPS heptosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.749704 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGCTT TACTACGTTC GACAATGGTT CGAGGTTTGG CGCTGGCGTT GGCTGGTTTT 
ACCCGCCCAG CACCAGCCCA ACCTCAACGA ATTTTGGTGA TTAAGCCCGA TCATTTGGGC
GATGTTTTAC TGTTGACCCC AGCGCTACGG GCCTTGCGCT ATAGCCAACC GCACGCCCAG
ATCAGCGTTT TGGTTGGTTC ATGGGCTACC CGCCTCTTGG CCGACAATCC TGATCTTGAT
GCGATTGAAA CCTGTGAGTT TCCTGGTTTT GTGCGCGGTG CGCAACCCTC GACCTTGGCT
CCTTATCGCT TGCTTTGGCG CGAAGCTGCC CGTTTACGAA GCATGAATTT TGATACAGCC
TTGATTGCCC GTGATGATCA TTGGTGGGGC GGCTTGTTGG CGCTTGGGGC TGGCTGTGGC
CGCCGAATTG GTTTTGCCCA TCCCTTGGTT GCGCCAACCT TAACCAAAGC TCTAGCGTGG
AATAGCAATG AGCATGTTAC CAAGCAAGCC TTGGATTTGG TGGCAGCGCT CGGTTCAGAT
CAACCACAAA CCCAAACGTT GCGGTTTATG CCAAGTGCCG CTGAGCATGC TTGGGCTGAG
GCTTGGCTGG CGCAGCATCA GATTCAAAAA CCATTGGTGG CAATTCAGGC TGGCAGTGGC
GGCGCAGCTA AATTATGGCC GGCTGAGCGT TGGGCACAGG TCGCTGAACA ATTGGCAAAT
CAAGCCCAAA TTGTATTAAC TGGTGGGCCA GCCGATGCTG TTGATGTGGC AGCAATCAGC
CAACAGTTGC AAATTCCCCA TTTGAATGCA GTTGGTCAGG CCAATTTGGG CCAATTGGCG
GCCTTGTTTG GGCGTTGCGC TTTGGTGTTG GGCGTGGATA ACGGCCCTTT GCATTTGGCC
GTGAGTCAAT CAACCCCAAC CATTCATCTG TTTGGGCCAG GCGATAAGCG GCGTTTTGGG
CCTTGGGGCG ACCCAACCCG CCACGTTGTG ATCGATGCCG AATTAGCCTG CTCGCCATGT
GGTGTGTTGA CCCATTGCCC ACGCCAAACC AAACCCAGCG AGTGCATGAC CGCAATTTCC
GTCCAGCACG TGATCGGTCA CGCCAAACGT CTGCTTGATC AGGCTGGAAC CTCAATTTAG
 
Protein sequence
MKALLRSTMV RGLALALAGF TRPAPAQPQR ILVIKPDHLG DVLLLTPALR ALRYSQPHAQ 
ISVLVGSWAT RLLADNPDLD AIETCEFPGF VRGAQPSTLA PYRLLWREAA RLRSMNFDTA
LIARDDHWWG GLLALGAGCG RRIGFAHPLV APTLTKALAW NSNEHVTKQA LDLVAALGSD
QPQTQTLRFM PSAAEHAWAE AWLAQHQIQK PLVAIQAGSG GAAKLWPAER WAQVAEQLAN
QAQIVLTGGP ADAVDVAAIS QQLQIPHLNA VGQANLGQLA ALFGRCALVL GVDNGPLHLA
VSQSTPTIHL FGPGDKRRFG PWGDPTRHVV IDAELACSPC GVLTHCPRQT KPSECMTAIS
VQHVIGHAKR LLDQAGTSI