Gene Haur_3336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3336 
Symbol 
ID5735206 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4205405 
End bp4206484 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content53% 
IMG OID641280483 
ProductTrkA domain-containing protein 
Protein accessionYP_001546100 
Protein GI159899853 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0569] K+ transport systems, NAD-binding component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.819763 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACCAG CCAAAACCCC TAGTCGCAAA GCGCGTTGGC GAAGACTGAT CGCGGCCTCG 
TTGCGCGATG GTTGGATTGT GTTTCGCGAT TCGGGAATTT GGCTGGGTTT ATGGCTTTTG
TTGTGGTTGG GGTTTACCCT GGCAATTTGG GCAGGCACGC GCCCAGTTTT AGCTTTTAAC
CAAGCGCTCT ATCAAGCATT TAGCCAAATG ACCCTCAACC CCGTGCCACT GCCTGAGCCA
TGGTGGCTGC AAGTATTGTT TTATCTCGCT CCAGCCTTGA ATATTATTTT GCTGGCGCGA
GGTGCGCTCA ATATGGGTAT TTTGCTGTTC GATAAACGCA ATCGGCGGGA GGCTTGGCAA
ATGGCGTTAG CATCAACCTA TCGTGATCAT ATTATTGTGT GTGGTTTGGG CAAAATTGGC
TATCGGGTGG TTGGGCAATT GCTGGCCAGC GGCTGCGATG TGGTGGTAAT TGATGCTCAC
AACGATGGGC CGTTTCACGA ATTGGTGATG GGTCAGCATG TGCCAGTGAT TATTGGCGAT
GCTCGTCAGC CTGAGCTGCT GCACGAAGCA GGTTTGCGCC ATGCCACCTC GCTCACTGTG
GTTACTGGCG ACGATTTGAC CAACCTCGAT ATTGCCTTGA CCGCCCGCGA ATTGCATCCT
GATATTCATA TTGTGATGCG GGTTTTTAAC GATTCATTGG CGAGCAAACT TAGCTCAGCC
TTTCATATTC AGACCGCATT TAGCACCTCG GCGCTGGCCG CCCCAACTCT TGCCGCTGCG
GCCTTGGGTC GCGGCATTAC CAACGCCTTG TATGTGGCAG GCAAGTTGCT CTCGACGGTG
GAAATTACGG TAGCCCGCGA TGGCATTTTT GATGGACGCT TGATCCAGAC GGTTGAAAAT
CAGCACGATA TTTCGGTGCT TTATCGGCGT GGGCGCAATG GCGAAGATTT ACGCCCACGC
GGCGATGAAC GTTTGAGCAG CGGCGATCAA TTGGTGATTA TCGGGCCGTT GGCAGCGATT
AATCAGATTC AAACGCTGAA TAAACCGAAT GCCGCGCCGC ATGCGCCCTA TCGACTTTGA
 
Protein sequence
MKPAKTPSRK ARWRRLIAAS LRDGWIVFRD SGIWLGLWLL LWLGFTLAIW AGTRPVLAFN 
QALYQAFSQM TLNPVPLPEP WWLQVLFYLA PALNIILLAR GALNMGILLF DKRNRREAWQ
MALASTYRDH IIVCGLGKIG YRVVGQLLAS GCDVVVIDAH NDGPFHELVM GQHVPVIIGD
ARQPELLHEA GLRHATSLTV VTGDDLTNLD IALTARELHP DIHIVMRVFN DSLASKLSSA
FHIQTAFSTS ALAAPTLAAA ALGRGITNAL YVAGKLLSTV EITVARDGIF DGRLIQTVEN
QHDISVLYRR GRNGEDLRPR GDERLSSGDQ LVIIGPLAAI NQIQTLNKPN AAPHAPYRL