Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4225 |
Symbol | |
ID | 5736079 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5385021 |
End bp | 5387225 |
Gene Length | 2205 bp |
Protein Length | 734 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641281380 |
Product | hypothetical protein |
Protein accession | YP_001546985 |
Protein GI | 159900738 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0726611 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGAAAC GATACACGCT GCATACCTGC ATCATTTTGG CGGTTGCTTT GGGCCTGCGC CTCGCCACAT GGTGGCTTTT GCCCTATAAC GATTGGATTA GTGACGAAGG CGAATATTGG GGTGCTGCTA TCTGGCTAGC GCAAGGCCGC GAATTTCAGT TTTTCGATAG CTGGATCTGG ACACGCCCGC CGCTGTATAT TAGTTTTTTG GCGGCGCATA TCAAGCTTTT TGGCAATAGC GCGTTGTGGG CTCCCCGCCT GAGCCAAGCC TTGATCAGCG TTTTGAGCGT TTGGCTAACC ATGCGGATTG CCAAGCGGCT CACTCCAACT GAACACCAAG CACGGGTTAG CTTAATTGCT GGCTGGCTGA TGGCGCTGGG CTATTCATTT GCTGCATTCA GCTATTTTAT GCTTTCAGAA ACGTTGTTTT TGAGCATCTT CTTGGCGGCG AATTTATTGT TGTTGCGCTG GGCTAGCACC CGCCATTGGC GCGATTTGCT CTTGGCTGGG GTAGGCTTTG GCTTTGCTGC CCTGACCCGA GCGATCATTT TAACATGGCT GCCGTTGCCC GCCTTGTGGA TTGCTTGGCA AATTTGGCGC ACTCAGCGCC CACGCTGGCA AGCCATGATT AAACCAGTGC TTGGCTTTAC GCTCAGCGTT TGTGTAATTG TCTTGCCGTG GACAGCCTTT GCAACCAATC GTTGGAGCAA CGGCGATGGC TTAATTTTGG TCGATACGAC TGGTGGGTAT AATTTTGCCC TCGGTGCGCA AATTGCCACG CCTGATGGCC GTAATGGCAC CCGCCTAGCC GAGATTTTGT GTGGTGGCAA TGGCTTAGTT TGCCAAGGCT CGCAGGCAGC ACGGCAAAAT CAAGCGTATG CCCAAGGGTT TGAGTGGCTA GGCGAAAATC CACAGCGCTT TATCAGCAAA ACGGCGCTTG AATTATTAGA TATTTTACAA GTGCGCTTTG ATAGCGCTGA ACATTTGACC GATGGCTATG TTGATGGCCG CGTGCCAGTG CCCCATCTCT TGGGCTTGTT GCTCGACGAC ACGCTGTATG TAGTGCTTGT AGGCTTGGCG GTGCTTGGGT TTTGGCGCAA GCAAGCGGTT GCTGGCAAAG GCTTGGTGCT TGGTTGGTTG GGCTATAACA TTATTGTTGG CTCGTTGATT TTTGCAATTG CCCGTTTTCG TCAACCGCTG ATTCCCTTCG TGATCATCTA TGCCGCCTTG GCAATCGTGC AGTGGTCGCA AGCTTGGGCC AGCAGCCGTC AGCAGCGTTA TGCTTGGGCT AGTGCTTGCT TGTTGTGGCT GATTGTGTTG CCATCGTATC TGTATTTACC AGAATCGGTC GGGGTACGCA GCGTTTGGCA AGATGTGCGT TTGGGCTTTG CTGGCGTGCA ACAAGCCAAC CAATGCCAGG CAATTCGTGA GCTATTGCAA GCTGGCGATG TAGTGGCAGC TCGCCAACTG CACGACCGCA TTGATGCCGA GGGTCGCAGC GAAAATACTA GCGTTAGTGG CTATACTGGG CGGCGCTGCT TGGCCTTGAT CAACGGCCAA TTGCTCGAAG CCGAAGCTAA ACCAGAGCAA GCCTTGGCCT TCTACCAACA GGCTAATCCC AAAAATAATC CGGTGCAATC GGCCCGCATT TTGATGCTCG AAGGCAATTT GCTGCAACGC CAAGGCCAAC TTAGCGCAGC AGTCGCCCGT TTTAATTTCC GCGATGTCGA AATTATTAAT GATCTGGCTT GGGCGTGGGA TTATTTGACC GTTGTGCCAA CCACCACCAT CGATCTTGGC TCTGGCTTGG ATTATGGCTA TGTGCGCGGA TTTTATCAGA GCGAGCATAA TCAACCAGAT TTTCGCTGGA GCAGCCAAGC CTCGGCTTTG CGTTTACCGC AAGCGGCGAC TGGTCAAGCG CAAACCCTGC GCTTACATCT GAATGGCTTT ACCAACGATT GGCAACCAAC CCGCATCAGC ATTAGCCTCA ATGGCCAATT GATCGATAGC TATCAACTCA AACCCGATTG GCACTGGCTG GAAATTGCTT TACCTGCCCA ACCGCAAGGC AGCGATTTGC TGATTGAATT TACTAGTAGC ACCTTTGTCA GCGGCCCCGA AGATTTAGCA ACGCGGGTCA GTTCGCGAGC CTCCGACCCA TTGCGCTTAT TGGGCTTTCA GCTTGATCGG GTCGAAATTA AATAG
|
Protein sequence | MMKRYTLHTC IILAVALGLR LATWWLLPYN DWISDEGEYW GAAIWLAQGR EFQFFDSWIW TRPPLYISFL AAHIKLFGNS ALWAPRLSQA LISVLSVWLT MRIAKRLTPT EHQARVSLIA GWLMALGYSF AAFSYFMLSE TLFLSIFLAA NLLLLRWAST RHWRDLLLAG VGFGFAALTR AIILTWLPLP ALWIAWQIWR TQRPRWQAMI KPVLGFTLSV CVIVLPWTAF ATNRWSNGDG LILVDTTGGY NFALGAQIAT PDGRNGTRLA EILCGGNGLV CQGSQAARQN QAYAQGFEWL GENPQRFISK TALELLDILQ VRFDSAEHLT DGYVDGRVPV PHLLGLLLDD TLYVVLVGLA VLGFWRKQAV AGKGLVLGWL GYNIIVGSLI FAIARFRQPL IPFVIIYAAL AIVQWSQAWA SSRQQRYAWA SACLLWLIVL PSYLYLPESV GVRSVWQDVR LGFAGVQQAN QCQAIRELLQ AGDVVAARQL HDRIDAEGRS ENTSVSGYTG RRCLALINGQ LLEAEAKPEQ ALAFYQQANP KNNPVQSARI LMLEGNLLQR QGQLSAAVAR FNFRDVEIIN DLAWAWDYLT VVPTTTIDLG SGLDYGYVRG FYQSEHNQPD FRWSSQASAL RLPQAATGQA QTLRLHLNGF TNDWQPTRIS ISLNGQLIDS YQLKPDWHWL EIALPAQPQG SDLLIEFTSS TFVSGPEDLA TRVSSRASDP LRLLGFQLDR VEIK
|
| |