Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0184 |
Symbol | |
ID | 5732093 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 213199 |
End bp | 214629 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641277308 |
Product | DNA repair protein RadA |
Protein accession | YP_001542964 |
Protein GI | 159896717 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1066] Predicted ATP-dependent serine protease |
TIGRFAM ID | [TIGR00416] DNA repair protein RadA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00274159 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCAAAC AACGTACAAT TTTCGTCTGC CAACAATGTG ATGCGCAGTT TCCCCGTTGG ATGGGTCGTT GTACCGAATG TGGTTCGTGG GATAGCTTGG TTGAACAGGT TATCGCAAAA TCAGCAGGTA CGACAGGCGG TAGTACTCGT TCACCTATTG GAGTGAGCGA ACCATTACGC CTGCCCGATA TTCAACTTGG CGATGTTCAA CGCTTGCCAG TTCGTGGCAG CGAATTTGCT CGCGTGCTTG GTGGCGGGAT TGTGCCAGGC TCGTTGGTGC TGATTGGCGG CGATCCTGGG ATTGGTAAAT CGACGCTCTT GCTCGAACAA AGTGCTGCTT TGGCGGAAAC TGCTGGCGAT GTGTTGTATA TCTCAGCGGA AGAATCGCAG CAGCAGATTA AGCTCCGCGC CACCCGTTTG GGCTTATCGG CGCAGCGTTT ATATATTTTG GCCGAAACCA GTCTTGATAC GGCGATTGCC ACAATTGAAC GCATGAAGCC AGTTTTGGTG ATCGTCGATT CGATTCAGAC CGTGTATAGT GAGAGTGTGA CCTCGGCGGC AGGCAGTGTT TCGCAGGTGC GCGAAGGTGC GCTACGGCTT CAGCGAGTTG CAAAACAACA CAATATTTCA ATTATGCTGG TTGGCCATGT GACCAAAGAA GGCGCGATTG CTGGGCCACG GGTGCTTGAG CATATTGTTG ATGTGGTGTT GTACCTTGAG GGCGAACGGT TTCATCAATA TCGCTTATTG CGCAGCGTCA AAAATCGCTT TGGCTCAACC AATGAAGTTG GGGTTTTCGA GATGAATCAA GGTGGCATGG TCGAGGTGAC CAATCCATCG CAGATTTTCT TGGCCGAGCG TAGCACCAAC TCGCCTGGCT CAGCGGTAGC AGTGATGTTA GAAGGCACGC GACCCTTGTT ATTGGAAGTG CAAGCCCTGA CCAGCCATAC TGCCAATGCC CAACCGCGAC GAACGGCCAA CGGCTTTGAT CAAAATCGCT TGGCGATGAT CATTGCGGTG CTCTCAAAAC GGGTTGGTGT GCCATTGTTC AATCAGGATA TTTATGTTAA TGTGGTAGGC GGTTTGAAAG TGACCGAGCC AGCGATCGAT CTAGCGGTGG CTACGGCAAT AACTTCATCA TTTCGCAACC AGCGGGTTGA ACCAAATACC GTTTTAATTG GTGAAATTGG GCTTTCGGGC GAGTTGCGTT CGGTAAGTCA GCTTGATCGA CGTTTGAATG AAGCCGCCAA GCTGGGTTTT CATAATGCAA TCGTGCCGCA GATCGATGCC TTACCGCAGA TCGATGCGTT TCAGGTTACT GGCATTCGTT CATTGATTGA AGCAGTGCGA GCTGCTTTAA TTGGTGCGCC GCGACCATCA CCTGGTGAGC CGCCAACGGC AAAGGCAACA ACTGATACTA ATGATGATTA A
|
Protein sequence | MAKQRTIFVC QQCDAQFPRW MGRCTECGSW DSLVEQVIAK SAGTTGGSTR SPIGVSEPLR LPDIQLGDVQ RLPVRGSEFA RVLGGGIVPG SLVLIGGDPG IGKSTLLLEQ SAALAETAGD VLYISAEESQ QQIKLRATRL GLSAQRLYIL AETSLDTAIA TIERMKPVLV IVDSIQTVYS ESVTSAAGSV SQVREGALRL QRVAKQHNIS IMLVGHVTKE GAIAGPRVLE HIVDVVLYLE GERFHQYRLL RSVKNRFGST NEVGVFEMNQ GGMVEVTNPS QIFLAERSTN SPGSAVAVML EGTRPLLLEV QALTSHTANA QPRRTANGFD QNRLAMIIAV LSKRVGVPLF NQDIYVNVVG GLKVTEPAID LAVATAITSS FRNQRVEPNT VLIGEIGLSG ELRSVSQLDR RLNEAAKLGF HNAIVPQIDA LPQIDAFQVT GIRSLIEAVR AALIGAPRPS PGEPPTAKAT TDTNDD
|
| |