Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3651 |
Symbol | |
ID | 5735512 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4590889 |
End bp | 4592814 |
Gene Length | 1926 bp |
Protein Length | 641 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641280800 |
Product | UvrD/REP helicase |
Protein accession | YP_001546415 |
Protein GI | 159900168 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0210] Superfamily I DNA and RNA helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00677749 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTGATA CACTCAATAG CGAACAACGG GCGGCGGTAA TGGCGCAGCT TGGGCCAGTT TTGGTCAAGG CTGGCGCGGG CAGCGGTAAA ACTCGCGTGC TGACCTACCG CATCGCCTAT TTAATCGAGC AGGGCGCTAG CTCCGATTCG ATTGTTTCGG TGACCTTCAC CAACAAAGCC GCCAGCGAAC TGCGTACCCG CTTGCGCGAT CTTTTGGGCA AGCGTAGCCG TGGCCTGACT GCCGGAACCT TCCACGCGAT TTGTGGCAAA TTGCTGCGCC AACATATTAA TGGACGAATT CGCAATTACA CTGCCAACTT TACAATTTAC GCTGGCGATG AGCAATTGCA ACTGGTGCAA CAAGCCATGG ATGGCTATAC CGGACGCATG CCGCACGATC TTGAAGCACC GCAAGTGCTC AATTTGATTT CGCGCTTCAA AAGCCGCATG CAACCACCAA CCTTAGCTCG GCAGATGGCT AGTGATCCGG TCAGTCAATA TGCAGCAGCG ATTTATCGCA CCTATCAACG TCAGCTTGAG CGCTCGAATG CTGTTGATTT CGATGATATG ATCGTGATGA CCTACAAGCT GTTGTTTGAG CATCACGATG TGCTCGATGA AGTTCAATCG CGCTGGGCGC ATGTGTTGGT CGATGAATAT CAAGATACCG ATTCGGCCCA ATATGCCTTG CTCGAGTTAC TTTCACGGCC AGTTGCTCAG CGCCCACGCT CATTGTTTGC GGTTGGCGAT GCCCAACAAT CGATCTATGG TTTTCGCAAC GCCGATTACA CGATCATCAA TCGCTTCACC CGCGATTTCC CTGAAGCCCA AGTGGTTGAG TTGTTGACCA ATTATCGTTC GCGCCAAGAG ATTCTTGATG CGGCCTATGC CGTGATGCGC CATTCGGTCG CTGTGCCAGC CTTGACCTTG AAAGCTGCGC GTCGTTCGCC GCCAGTGCCA GCCTTGGTGA TCAACGAGGC TACTGATGAT CGCGCCGAAG CCGATGCGAT TGCCAAATCA ATTGGTAATT TGCTGAGCAC AGGTCGTCGC AGCAAAGATA TTGCGATTTT GTATCGTTCG CGGCATATGA GCCGTGGCCT AGAAACGGCG CTGCGCCAAG CACGGATTCC CTATTCGCTC AAGGGTCAAG CGGGGTTTTA CGATCGGCGG GTGATTCGCG ATGCTTTGGC CTTTTTGCGG ATTATTGCTA ATCCCAGCGA TAGCCTGAGC ATGAATCGGA TTATCAATGT GCCAGCGCGT GGCATTGGCG CTAGCACAAT CGCCCATCTC ACCAGCAAAG CTGCTGCACT GGAACTACCG CTCGGTGAAG CCTTGTTGAA GCGTGAAGCA TGGGAAGGCT TGAGTGATCG AGCCAGCCAA ATGGTCAGCG ATTGGGCACG CAAAGTCTAT CGTTGGCGCA AATTGGCTAG CGGCAGCTAC CCACCCGAGG GCTTGTTGCA AACGGTGCTG CAAGAAAGTG GCTATCAATC CATGATCGAA AAAGACTGGA ACGACCCTGA GCGCAGTGAT GCCCTGGCCC ACCTTGAAGA ATTACGGGTT GCTGCTGGCG AACATACCAG CCTCGCAGCC TTTTTGCAGG AAATTGCCCT GCTGACCAAC GTTGAAGACA AAGATGAGCG TGATGCTGTT CAATTGCTGA CCATCCACGG AGCCAAAGGC TTGGAATGGC CAATTGTCTA CGTGGCGGGC TTGGAAGAAG GTACATTGCC CCACGAACGT TCGTTGGTCG AAGCAGGCGG GGTGGAAGAA GAGCGGCGTT TGTGCTATGT GGCCTTGACC CGCGCAGGTG AACAACTCTA TCTTTCGCAT GCCAAAAAAC GCCAACGCAA CCAACGTAAT CCATCGCGCT TCCTTGATGA TATTTTGGTT TATGGCCGTG AACGCGCCAA AGCCAAGGCT TTCTAA
|
Protein sequence | MFDTLNSEQR AAVMAQLGPV LVKAGAGSGK TRVLTYRIAY LIEQGASSDS IVSVTFTNKA ASELRTRLRD LLGKRSRGLT AGTFHAICGK LLRQHINGRI RNYTANFTIY AGDEQLQLVQ QAMDGYTGRM PHDLEAPQVL NLISRFKSRM QPPTLARQMA SDPVSQYAAA IYRTYQRQLE RSNAVDFDDM IVMTYKLLFE HHDVLDEVQS RWAHVLVDEY QDTDSAQYAL LELLSRPVAQ RPRSLFAVGD AQQSIYGFRN ADYTIINRFT RDFPEAQVVE LLTNYRSRQE ILDAAYAVMR HSVAVPALTL KAARRSPPVP ALVINEATDD RAEADAIAKS IGNLLSTGRR SKDIAILYRS RHMSRGLETA LRQARIPYSL KGQAGFYDRR VIRDALAFLR IIANPSDSLS MNRIINVPAR GIGASTIAHL TSKAAALELP LGEALLKREA WEGLSDRASQ MVSDWARKVY RWRKLASGSY PPEGLLQTVL QESGYQSMIE KDWNDPERSD ALAHLEELRV AAGEHTSLAA FLQEIALLTN VEDKDERDAV QLLTIHGAKG LEWPIVYVAG LEEGTLPHER SLVEAGGVEE ERRLCYVALT RAGEQLYLSH AKKRQRNQRN PSRFLDDILV YGRERAKAKA F
|
| |