Gene Haur_4279 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4279 
Symbol 
ID5736138 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5463593 
End bp5464834 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content52% 
IMG OID641281439 
Productglycosyl transferase group 1 
Protein accessionYP_001547039 
Protein GI159900792 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00024185 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCATGC TCTACTTCAC CACTGCCTAC AACGCTGCCT TGCTTGATCG GGTGCATGAA 
GAATTTTTGT TGCGCTGGCA GGCGCTTGGC CATGAAACCA GCATTCTTGT GCCCGACTCC
AGCCGCGACC GCCAAAGCCG TTGGTTGGTC GAAGATGGTG CAATTCAGGT CTATCGGCCA
GCAGTGAGCA TGCAGCGCAG CGATCGGCTA TTGAATAATG TAGGCCGACG CTTGACTGAA
TATAGCTATT TTTTCAGTTT GCTGCGCGAT TATTTAAGCT TTTTGCGCCA ACACCCTGAG
ATCGAGATTA TTCACGTTGA GTCGGTTTAT CCATTGGGGG CGATTGCCGC CGTGGCTTCG
TTGATTGATC GACGGCCATT TGTACCAACC ATCCGTGGTG GCGACTTAAT TGCTGATGAT
TCGATCAGCT ATGGTTTTGC TCGCTACAAA CGAGTCCGTG CTTTGCTCAA ATTGACCTTT
GCGCGAGCTG CGGCAATTCG TTCGGTTTCA CCAAGTGCCA GTGCGATGGC CGAGCAATTT
GGCTGCCCAA CGCAGAAAAT CATCACGATT GGTCGGAATA TTCGCGACGA ATATTTCGAG
CGCGATCAAG CGGCCTTTCG GGCAGAAAGT CGAGCTTGGT TGCGCCAAAC CTACCCTGCA
ATTGCTGGGC GCAACGTGAT CGTCGCGGCA GGACGTTTAT TGCCAGTCAA AGGCTTTGAT
GATTTGATTC AGGCCTTGGT GGGTTTACCA CAGGCTGTAG CACTGATTTG CGGGCCAAAT
CGAGTTGACG AAAAACTTGG CGATTATGGC GAATATTTAG GCCAATTGGC CCATCGCCAT
AGCGTGGCCG ATCGAGTAAT CTTTACAGGA GGCATTCCCC GTGAGCAAAT GCCGCAGTAT
TTTGCTGGAG CCGACGTGCT AGCCGTACCT TCGATTATTG AAGGCGGCAA CCGCACCGTT
TTAGAAGCAG CAAGCTTGGG AGTGCCCTTC GTGGCGACTC GCAGCGCAGG CACACCCGAA
TTTTTTAGTG CTGCTGCCGG CATTAGCATC GCGCCACATC GGCCTGATCA ACTCTGGGCT
GGCTTAGCCA CAATTTTAGC TGAAACCTCG GAACAAGCCC AGGCTCGCAG CCAAACCTGT
CAACAAGAAG CCCAACAATT TTATTCACCT CAAGTCGCCC AGCGGCTCGC TCGGCTTTAT
ACAGCAATTT TAGCCAAGCA GCCGCTATCA GGGAACTTTT AG
 
Protein sequence
MRMLYFTTAY NAALLDRVHE EFLLRWQALG HETSILVPDS SRDRQSRWLV EDGAIQVYRP 
AVSMQRSDRL LNNVGRRLTE YSYFFSLLRD YLSFLRQHPE IEIIHVESVY PLGAIAAVAS
LIDRRPFVPT IRGGDLIADD SISYGFARYK RVRALLKLTF ARAAAIRSVS PSASAMAEQF
GCPTQKIITI GRNIRDEYFE RDQAAFRAES RAWLRQTYPA IAGRNVIVAA GRLLPVKGFD
DLIQALVGLP QAVALICGPN RVDEKLGDYG EYLGQLAHRH SVADRVIFTG GIPREQMPQY
FAGADVLAVP SIIEGGNRTV LEAASLGVPF VATRSAGTPE FFSAAAGISI APHRPDQLWA
GLATILAETS EQAQARSQTC QQEAQQFYSP QVAQRLARLY TAILAKQPLS GNF