Gene Haur_4269 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4269 
Symbol 
ID5736128 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5449262 
End bp5451298 
Gene Length2037 bp 
Protein Length678 aa 
Translation table11 
GC content48% 
IMG OID641281429 
Producthypothetical protein 
Protein accessionYP_001547029 
Protein GI159900782 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0936509 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAATTGG TTGGTTCAAC GCTTTTTCGA GATCGATCAT CGATTTTTGC TGAGCCATGG 
CGGGTTTGGC TCGCTGCGGT TGGGGGCTTG GTGCTTTGGG CGGTATTGCT CTTAACCATT
ACTTATCAGT GGCAAGCTCA AGCCACAATT ACGCTTGGCG ATTATTATGA TCCGCCTTTT
CTCCAAGGCA ATTTCTCGCC TGGCGAGGTC AGTCAGGGTA ATAATCGTTC GTTTCGCTGG
ACAACTGGCG AGGCCACACT TGCCGTTCCT TTAGCTGGTC GCGGCAGTTG GACAACCAAA
ATCGATTTGT TAACTCAACA TCCTGATGCT AGTCCAGTTG AGGCAACCCT AAATTTTGCC
CCGAATGTGG CTGTGGCACT GCCAGATAGC AATGAGACGC GAATTATTCA TGCCTTTATT
CCAGCTACTG CAACTAGCAA TGGCAATCTG GCAATTGGGC TGAATTCAAA TCTATATCAG
GAGCAAACGG CTAGTGCCCG AACCTTGGGC GTAGCCTTAT TTAATGTTGA GCTTGCTTCA
ACGACAGGCC GCCCTTGGTT GCCACCATTG CTTGCAGTAG TTTTGCTCAG CATTATTTTG
CTTGGCATCG CCGCGAGCAT TCTGCTGACT GGCATCAATT GGACTTGGGC GGTTGGCCTT
AGCGCGGCTC TTGGCGTTGG TTTGGCGCTG CCAGTAATGT TGGCGCGTGT GCCACATACC
TTTTGGTTGC CAAATTTGGC GGTGCTGGCG GTTTTGAGCG TTGGCTTAGT TGCTGCGCTA
CGCAAAATTG TGCCGTGGCT GATGCACAAG GGTGGCATTG AATTAGAGCC ACGAACCTTG
ACGATTTTGC TTGGCTTGTT TTTATTTGGC TTTTGGGTCA AAGCTGCGGG CCAAGTCTAT
CCATATATGA TCGCGATCGA TATTCATTGG CATATGGAGC GAGTACGCTG GATTTTGGAT
GGTCGTTTGG CTGAAATGTA TAAGCCAGGT GCGTTCAATG AGTCAGTGAT GCCCGAAAAG
GAATTCGGCA AGGATCGGCC AATTATTCCA TATTCGCCGT TTTTCCATAT TTTCGCCACT
TCGTTTGCCT TGCTGCCCTT CCAAATGGAG ACAACGGCCA AGATATTTAG TTCAATCATT
GATTCGAGCT ATATCTTTTT GATTTATTTC TTTGCCCGTT CGTTTGATTT CTCGCGACGG
GTCGGGTTGT TGGCGGCAGC CTTTTATAAC GTAGTGCCAT TAACCTATTT GTTGCATTCA
TGGGGCAATG TGCCAACCAC CTTTGGTATG TGGTGGACAT TTATGAGCAT TGCTTTTGTG
ATTGGGGCTT GGGGCAAGCT TGATCAACGC AAGGTTTGGT GGAGTTTCGC CGCCTTGTTG
ACTGTGGCAT TTTTGATGTA CACGGTGATG GCGGTATTCT TGGGCTTGTT GCTGTGCATT
TGGATGAGCT ATATCGCCAT CACTAAGCGT AGCGAACGTC GCCAAGCCCG TTCAGTAATC
ACAGCAATGT TGGTGGGGGT GCTTGGTTCA ACGATCGTCT ATTATGGCTT GTATATTCCA
GGCATCATTG AAAAAACGAT TCCTTACTTC ACCACAACCT TTACCGAAGG CCAAGAAGCT
GTCGGCGCAA TTCAATATCA GCCCACAGCC TACGATAATT TCCTCGCGTA TCGCACCCGC
TTATGGAACC ATGGCTTGAT GATTCCCTTC CTGCTATTGC CATTTGCTTT GTGGGTGATC
GCTCGCTTGC GCACGCGCCA GCCCGAAACT CGCTTGTGGA TGGGATCGAT TTGGATGTGT
GCGATGGTAA CTGTTTCGTT ACTATTCACC GTGATTGATC GCAATGTGCC AATGGTTGAT
AAGCATATTA TTTTCTTGAT TCCAGTGTGG GCGATTTTGA TGGCAATGTT GATGGATATG
GCGCTCAAAC GTTGGCGCTG GAGCAGCATT CTGTTTGGCT TGGCCTACCT TGGGCTATTT
GCAATGTCGA TTGAATTATG GACACGGCGG ATCGCAACGG TGAAACAAAT ATGGTAG
 
Protein sequence
MQLVGSTLFR DRSSIFAEPW RVWLAAVGGL VLWAVLLLTI TYQWQAQATI TLGDYYDPPF 
LQGNFSPGEV SQGNNRSFRW TTGEATLAVP LAGRGSWTTK IDLLTQHPDA SPVEATLNFA
PNVAVALPDS NETRIIHAFI PATATSNGNL AIGLNSNLYQ EQTASARTLG VALFNVELAS
TTGRPWLPPL LAVVLLSIIL LGIAASILLT GINWTWAVGL SAALGVGLAL PVMLARVPHT
FWLPNLAVLA VLSVGLVAAL RKIVPWLMHK GGIELEPRTL TILLGLFLFG FWVKAAGQVY
PYMIAIDIHW HMERVRWILD GRLAEMYKPG AFNESVMPEK EFGKDRPIIP YSPFFHIFAT
SFALLPFQME TTAKIFSSII DSSYIFLIYF FARSFDFSRR VGLLAAAFYN VVPLTYLLHS
WGNVPTTFGM WWTFMSIAFV IGAWGKLDQR KVWWSFAALL TVAFLMYTVM AVFLGLLLCI
WMSYIAITKR SERRQARSVI TAMLVGVLGS TIVYYGLYIP GIIEKTIPYF TTTFTEGQEA
VGAIQYQPTA YDNFLAYRTR LWNHGLMIPF LLLPFALWVI ARLRTRQPET RLWMGSIWMC
AMVTVSLLFT VIDRNVPMVD KHIIFLIPVW AILMAMLMDM ALKRWRWSSI LFGLAYLGLF
AMSIELWTRR IATVKQIW