Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2800 |
Symbol | |
ID | 5734681 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3559699 |
End bp | 3560976 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641279943 |
Product | hypothetical protein |
Protein accession | YP_001545566 |
Protein GI | 159899319 |
COG category | [R] General function prediction only |
COG ID | [COG0628] Predicted permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0330256 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGAGC TTTCGTTGAC TCTTGCTCGT TGGACAACTC GTCGTGTCAT GACCGCCACT TTAGTCGTCC TCGCGATCAT CGGCCTCGCT TTTGTGATTG TTAATTTTTA TAGTGTGTTT GTGGTAGCCT TTATTGCCTT TGTGCTCAGC ACCGCGATTC GCCCATTGGT GCAATTATTA CAACGGCTCA AAATCTCACC CCAACTAGGG GTAATTATTG CCTACCTGCT ATTATTGGCA CTGCTGGTGG GCATTGTTGT GCTGATGGCT CCGCTGATCA CCGAACAAAT TACCGCAATT ACTGCCAAAA TTCCTGAGTA TTATCATGAT GCCCGCCAAT TTTTAATTTC TTCGCGCAGT AGTTTTATTC GCAATTTAGC GTTACGTTTG CCACTTGATG CCCCAATTTC GCTATCTGGA GTTGCGCCCA GCGAAACCAG CCAGCAACAA ACTGACGTAG CAGTAAGCCA ACTTGTCACT GTTGTAGAAA ATGCGGGCAT CAGTTTGTTT GTGTTGATTG CCACCTTACT GCTGGGGTTT TATTGGACGC TTGATGGTGA TCGGGTACTG AGAACCTTGC TCCAGCTCGT CGCGGCAGAA AAACGTGAGA ATTGGCGTTC GCTGATTGCC GAAATTCAGG CCAAAATGGG CGCATTTATT CGCGGACAAC TCATCCTTGA TCTCTCGATT GGGGCACTTT CAACCGCCGC CTACTTGCTA ATTGGTATCG ATTATGCGAT TGTACTGGGC ATTTTGGCTG GTTTACTCGA AACCATCCCG ATTTTGGGGC CAGTGCTTGG GGCAGTGCCA CCCTTGCTGA TTACCCTGGC CCAAGGTGAC ACAACCGCCT TTATTTGGGT GATTGTAGCG ACCGTGGTGA TTCAACAAAT TGAAGGCACT TTTTTAGTGC CCAAAGTGAT GGATCGGGCG GTTGGGGTGA ATGCGGTGCT GACCTTGGTC GCTTTTGCGG CCTTTAGTGC GACCTTAGGC TTGGCTGGGG GGATTTTGGC CGTGCCATTG GCAGCCATTG TGCAAATTAT CTTCACTCGC TTGGTGTTTA ATCAAGCTGA AACCAGCACC AACGTCACTC GCCGTGATCG CTTTGGCGTG CTGCATTATG AATCACAGCA ATTGCTCCAA TCCTTGCAAC GCCATAATCG GGCTGATGAA ACTGAAGATG AAGCTAATAT TGTGTTTGAC GATCAGCTCG AACATGTGGT CGTCGATTTG GATGGCATGT TGGCGGCGAT CAATAGCAGC GAGGAAGTAG CGGCATGA
|
Protein sequence | MDELSLTLAR WTTRRVMTAT LVVLAIIGLA FVIVNFYSVF VVAFIAFVLS TAIRPLVQLL QRLKISPQLG VIIAYLLLLA LLVGIVVLMA PLITEQITAI TAKIPEYYHD ARQFLISSRS SFIRNLALRL PLDAPISLSG VAPSETSQQQ TDVAVSQLVT VVENAGISLF VLIATLLLGF YWTLDGDRVL RTLLQLVAAE KRENWRSLIA EIQAKMGAFI RGQLILDLSI GALSTAAYLL IGIDYAIVLG ILAGLLETIP ILGPVLGAVP PLLITLAQGD TTAFIWVIVA TVVIQQIEGT FLVPKVMDRA VGVNAVLTLV AFAAFSATLG LAGGILAVPL AAIVQIIFTR LVFNQAETST NVTRRDRFGV LHYESQQLLQ SLQRHNRADE TEDEANIVFD DQLEHVVVDL DGMLAAINSS EEVAA
|
| |