Gene Haur_2231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2231 
Symbol 
ID5734118 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2837196 
End bp2839532 
Gene Length2337 bp 
Protein Length778 aa 
Translation table11 
GC content49% 
IMG OID641279372 
Producthelicase superfamily protein 
Protein accessionYP_001544999 
Protein GI159898752 
COG category[R] General function prediction only 
COG ID[COG1205] Distinct helicase family with a unique C-terminal domain including a metal-binding cysteine cluster 
TIGRFAM ID[TIGR03158] CRISPR-associated helicase, Cyano-type 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCACAA TTCAATTAAC CGGCCATAAG GAAAAACTCG CGCCGCCGAA TACCATCCCA 
AACTATCGCG CAGCAGCACC ATTGCTCTAT CATCAAGTGC GCACGTACCA AGCACTTGAG
CAAGCGCCCT TGGTAATGAA TACCTATGCA ACTGGCACGG GCAAAACCAC CGCTGCCTCA
TTACGCTTGC TCCACCCTGA TCAACAAAAG CGCAATACAT TAATTATTGC GCCAACCAAC
GCGCTGATTG AGCAGCATTG TGCCGATGCC CAAGCATTTA TCGATGACAA CCAGCTTGCC
ATGCGCACCC TGCCCATAAA TGCTGCGACA ATCCGCGATA TCACCAGCGA AACTCGGCGG
GGCGAAATTT TGCAGCGTTT AATTGCCAAC CCGCTAACCT TTGCTGAGGC ACTAGGGCTG
CCTAACGATG CGGAAAAGCT ACCATTCGTG GCGGTCACCA ACCCCGATAT TTTCTACTTG
GCGCTCTACT TTCGCTATGG CCCCCTCGAT CAACGTAATG TCTGGGAGAA ATTCATCCGT
CAATTTCACT ATATTGTGAT TGATGAATTT CACTATTACG ATAACAAGCA ATTTAGCAAC
TTCTTGTTCT TCTTTAATCT CTGGAAAGAA TGGGGCTACT TTGAGGCTGG CTATAAAATT
TGCCTGCTCT CGGCAACTCC CCGTGACGCA GTTTATCGCT ACTTGAACCA AGTGTTTGGG
GCTGATGGCT GGCAACGAGT CGGTCCCGAC AACGAACCTG AATCCAGCTC CGCTTTGCCG
CCGACCCCAA CTTTAGCCCC ATTAACCATG CATATTCACA ATGACAATCT AGAGGAATGG
CTGCAAACAC AAAGCCAGAT CGTCGCCGAT TGGCAGGCGA ACGCGCTTGA TAGCGTGATT
ATTAGCAGTA GTCTTGGCAA AATCAATCGG ATTCACGCAA CATTACGTCA ACTCAATCCA
ATTCGCATCA CTGGGCCAGA GCCACCCGAA CATCGACGTT TGGTGCGGCC AGTGATTTTG
GCTACACCGA CGGTTGATAT TGGCTACAAC TTTGGTCGGC CCGGCAAGCA ACGTCAAAGT
ATCGATCGGC TGATTTGCGA TGCAAAATTT GGCGACGAAC TAACCCAACG CATCGGGCGG
GCTGGGCGGG TTTTGGGGCG CGAGCAAACC GATATCCCTG GCGAGGCCCA TATTTTTATT
AGCGAAGATG CCTTTGGTGA ATTAAAACAG TGGGATACCC AAACGCTCAA TCGTGCTGAA
TGGGCCAGCA TCATTCACCA ACTTGAACAT CTGCCAGCCA AACATCAACT TGAAGGCTAT
ATTCGCCAAT ATGCCCTGCT CGAAACGTTT TATCCATTGC TCAAATTGCA ACAACTAACC
CCCAAAGACG ACCCTGATCA AGAGGCGTTA TTTAATGCGA TGCGCGATAT TTTCGCTCCC
AATAGCCAAC GCACAATTGG CAGTTTACGC CACTTCTATC GGGTTTATGA GGAGCGCGAA
CAATGGCTCA GGCTCTCGGA AAGTGCCAAA TGGGCCGATC GAGGGGCAGT TGCCAAACAT
TTTGCCGCCC AACTTTCATG GCTTGCCTCG ACCAAAGATC GGCAACAGGA AATCAAGCCA
GAACAAGTAC GCGGTATTCT CGATCAACGC TTGATTGGCC ATAGCACACC GCAAAATGCT
CTGATCGATT TTATCGAATC GCAAGTTGTG ATGACCAAAG CGCTCTTCAA CTTTCGCGAG
GCTTGGCAAG GCCCAAAGGC CGCAATTTAC GACCCCAATC AGTTACTTTC AAGCCAAACA
TTAACCTATT TTGATTTATT TCACGCTTTT TGTCATTTTG ATGTGACAAT CTATCCATCC
AAAACCAAAT TTGAGCAAGA CGCTGGGTCA AGCGAATCAG CCGACGTGTA TTTGCGCATC
AACCAACTCC GCCCAACCCC GCTGGGCCTC GCCTTCGAGC ATCCAAACAC CGATCAACTT
GATCAAACGA CCTTCGATGA TCGCTATTGC AACACGATTA TTGGCATTAA AGGCCTATTG
CTCAGCGCCT ACGAATACGG CTCACGAGCA ACCGTGCCAA TCCCAACTGA AATTCGCAAT
ACCGTTAAAA GCAACGCCAT CCCTTGTCTG ATTGTTGATC AAGCAAGCAC CAATGCATTA
ATTCGGGTAC TTCAGGGCAG CCCAATCTAT CGTCAAGTAT TAAAAGTCGA TTTTGGCGGG
TTATTCGAGG AATATACGAT GGTAACCGGA ACCGCCGCCT TCCATGTGAT TCCTGAACTT
AAACGCCACT TTTTGATGCG CCAAAAACGG GTCAACGATC AGCCAATATT TTTATAA
 
Protein sequence
MITIQLTGHK EKLAPPNTIP NYRAAAPLLY HQVRTYQALE QAPLVMNTYA TGTGKTTAAS 
LRLLHPDQQK RNTLIIAPTN ALIEQHCADA QAFIDDNQLA MRTLPINAAT IRDITSETRR
GEILQRLIAN PLTFAEALGL PNDAEKLPFV AVTNPDIFYL ALYFRYGPLD QRNVWEKFIR
QFHYIVIDEF HYYDNKQFSN FLFFFNLWKE WGYFEAGYKI CLLSATPRDA VYRYLNQVFG
ADGWQRVGPD NEPESSSALP PTPTLAPLTM HIHNDNLEEW LQTQSQIVAD WQANALDSVI
ISSSLGKINR IHATLRQLNP IRITGPEPPE HRRLVRPVIL ATPTVDIGYN FGRPGKQRQS
IDRLICDAKF GDELTQRIGR AGRVLGREQT DIPGEAHIFI SEDAFGELKQ WDTQTLNRAE
WASIIHQLEH LPAKHQLEGY IRQYALLETF YPLLKLQQLT PKDDPDQEAL FNAMRDIFAP
NSQRTIGSLR HFYRVYEERE QWLRLSESAK WADRGAVAKH FAAQLSWLAS TKDRQQEIKP
EQVRGILDQR LIGHSTPQNA LIDFIESQVV MTKALFNFRE AWQGPKAAIY DPNQLLSSQT
LTYFDLFHAF CHFDVTIYPS KTKFEQDAGS SESADVYLRI NQLRPTPLGL AFEHPNTDQL
DQTTFDDRYC NTIIGIKGLL LSAYEYGSRA TVPIPTEIRN TVKSNAIPCL IVDQASTNAL
IRVLQGSPIY RQVLKVDFGG LFEEYTMVTG TAAFHVIPEL KRHFLMRQKR VNDQPIFL