Gene Haur_4225 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4225 
Symbol 
ID5736079 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5385021 
End bp5387225 
Gene Length2205 bp 
Protein Length734 aa 
Translation table11 
GC content52% 
IMG OID641281380 
Producthypothetical protein 
Protein accessionYP_001546985 
Protein GI159900738 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0726611 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAAAC GATACACGCT GCATACCTGC ATCATTTTGG CGGTTGCTTT GGGCCTGCGC 
CTCGCCACAT GGTGGCTTTT GCCCTATAAC GATTGGATTA GTGACGAAGG CGAATATTGG
GGTGCTGCTA TCTGGCTAGC GCAAGGCCGC GAATTTCAGT TTTTCGATAG CTGGATCTGG
ACACGCCCGC CGCTGTATAT TAGTTTTTTG GCGGCGCATA TCAAGCTTTT TGGCAATAGC
GCGTTGTGGG CTCCCCGCCT GAGCCAAGCC TTGATCAGCG TTTTGAGCGT TTGGCTAACC
ATGCGGATTG CCAAGCGGCT CACTCCAACT GAACACCAAG CACGGGTTAG CTTAATTGCT
GGCTGGCTGA TGGCGCTGGG CTATTCATTT GCTGCATTCA GCTATTTTAT GCTTTCAGAA
ACGTTGTTTT TGAGCATCTT CTTGGCGGCG AATTTATTGT TGTTGCGCTG GGCTAGCACC
CGCCATTGGC GCGATTTGCT CTTGGCTGGG GTAGGCTTTG GCTTTGCTGC CCTGACCCGA
GCGATCATTT TAACATGGCT GCCGTTGCCC GCCTTGTGGA TTGCTTGGCA AATTTGGCGC
ACTCAGCGCC CACGCTGGCA AGCCATGATT AAACCAGTGC TTGGCTTTAC GCTCAGCGTT
TGTGTAATTG TCTTGCCGTG GACAGCCTTT GCAACCAATC GTTGGAGCAA CGGCGATGGC
TTAATTTTGG TCGATACGAC TGGTGGGTAT AATTTTGCCC TCGGTGCGCA AATTGCCACG
CCTGATGGCC GTAATGGCAC CCGCCTAGCC GAGATTTTGT GTGGTGGCAA TGGCTTAGTT
TGCCAAGGCT CGCAGGCAGC ACGGCAAAAT CAAGCGTATG CCCAAGGGTT TGAGTGGCTA
GGCGAAAATC CACAGCGCTT TATCAGCAAA ACGGCGCTTG AATTATTAGA TATTTTACAA
GTGCGCTTTG ATAGCGCTGA ACATTTGACC GATGGCTATG TTGATGGCCG CGTGCCAGTG
CCCCATCTCT TGGGCTTGTT GCTCGACGAC ACGCTGTATG TAGTGCTTGT AGGCTTGGCG
GTGCTTGGGT TTTGGCGCAA GCAAGCGGTT GCTGGCAAAG GCTTGGTGCT TGGTTGGTTG
GGCTATAACA TTATTGTTGG CTCGTTGATT TTTGCAATTG CCCGTTTTCG TCAACCGCTG
ATTCCCTTCG TGATCATCTA TGCCGCCTTG GCAATCGTGC AGTGGTCGCA AGCTTGGGCC
AGCAGCCGTC AGCAGCGTTA TGCTTGGGCT AGTGCTTGCT TGTTGTGGCT GATTGTGTTG
CCATCGTATC TGTATTTACC AGAATCGGTC GGGGTACGCA GCGTTTGGCA AGATGTGCGT
TTGGGCTTTG CTGGCGTGCA ACAAGCCAAC CAATGCCAGG CAATTCGTGA GCTATTGCAA
GCTGGCGATG TAGTGGCAGC TCGCCAACTG CACGACCGCA TTGATGCCGA GGGTCGCAGC
GAAAATACTA GCGTTAGTGG CTATACTGGG CGGCGCTGCT TGGCCTTGAT CAACGGCCAA
TTGCTCGAAG CCGAAGCTAA ACCAGAGCAA GCCTTGGCCT TCTACCAACA GGCTAATCCC
AAAAATAATC CGGTGCAATC GGCCCGCATT TTGATGCTCG AAGGCAATTT GCTGCAACGC
CAAGGCCAAC TTAGCGCAGC AGTCGCCCGT TTTAATTTCC GCGATGTCGA AATTATTAAT
GATCTGGCTT GGGCGTGGGA TTATTTGACC GTTGTGCCAA CCACCACCAT CGATCTTGGC
TCTGGCTTGG ATTATGGCTA TGTGCGCGGA TTTTATCAGA GCGAGCATAA TCAACCAGAT
TTTCGCTGGA GCAGCCAAGC CTCGGCTTTG CGTTTACCGC AAGCGGCGAC TGGTCAAGCG
CAAACCCTGC GCTTACATCT GAATGGCTTT ACCAACGATT GGCAACCAAC CCGCATCAGC
ATTAGCCTCA ATGGCCAATT GATCGATAGC TATCAACTCA AACCCGATTG GCACTGGCTG
GAAATTGCTT TACCTGCCCA ACCGCAAGGC AGCGATTTGC TGATTGAATT TACTAGTAGC
ACCTTTGTCA GCGGCCCCGA AGATTTAGCA ACGCGGGTCA GTTCGCGAGC CTCCGACCCA
TTGCGCTTAT TGGGCTTTCA GCTTGATCGG GTCGAAATTA AATAG
 
Protein sequence
MMKRYTLHTC IILAVALGLR LATWWLLPYN DWISDEGEYW GAAIWLAQGR EFQFFDSWIW 
TRPPLYISFL AAHIKLFGNS ALWAPRLSQA LISVLSVWLT MRIAKRLTPT EHQARVSLIA
GWLMALGYSF AAFSYFMLSE TLFLSIFLAA NLLLLRWAST RHWRDLLLAG VGFGFAALTR
AIILTWLPLP ALWIAWQIWR TQRPRWQAMI KPVLGFTLSV CVIVLPWTAF ATNRWSNGDG
LILVDTTGGY NFALGAQIAT PDGRNGTRLA EILCGGNGLV CQGSQAARQN QAYAQGFEWL
GENPQRFISK TALELLDILQ VRFDSAEHLT DGYVDGRVPV PHLLGLLLDD TLYVVLVGLA
VLGFWRKQAV AGKGLVLGWL GYNIIVGSLI FAIARFRQPL IPFVIIYAAL AIVQWSQAWA
SSRQQRYAWA SACLLWLIVL PSYLYLPESV GVRSVWQDVR LGFAGVQQAN QCQAIRELLQ
AGDVVAARQL HDRIDAEGRS ENTSVSGYTG RRCLALINGQ LLEAEAKPEQ ALAFYQQANP
KNNPVQSARI LMLEGNLLQR QGQLSAAVAR FNFRDVEIIN DLAWAWDYLT VVPTTTIDLG
SGLDYGYVRG FYQSEHNQPD FRWSSQASAL RLPQAATGQA QTLRLHLNGF TNDWQPTRIS
ISLNGQLIDS YQLKPDWHWL EIALPAQPQG SDLLIEFTSS TFVSGPEDLA TRVSSRASDP
LRLLGFQLDR VEIK