Gene Haur_0618 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0618 
Symbol 
ID5732516 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp713594 
End bp715330 
Gene Length1737 bp 
Protein Length578 aa 
Translation table11 
GC content51% 
IMG OID641277745 
Productfibronectin-binding A domain-containing protein 
Protein accessionYP_001543394 
Protein GI159897147 
COG category[K] Transcription 
COG ID[COG1293] Predicted RNA-binding protein homologous to eukaryotic snRNP 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACATTG ATGCGCTAGT TGTTGCAGCG ATTGTCGCTG AATTACAGAT CTTGGTTGGC 
GGCAAAATTC AACAGGTGGT TTTGCCAAAT CCTGATAGTG TAGGTTTTGA AGTCTATGCT
GATGGCCAGC GCCATCAGTT ATTGCTTTCG GCCAATCCTA AATTTGCCCG CATGCATACT
ACCCCAACCA AGCTAACTCG TGATCCGAAT GCCGATTCGC CTTTGTTGCT ATTATTGCGT
AAATATGTTC GTGGTGGACG GATTACCAAA ATCGAATCTG CGCCATTGGA ACGGGTTATT
TCGCTCAGTA TAGCCAAGAT GCCGATTCCT CGCAAGGAAC TTGAGCCTGA CGATGATGAC
GACGATGAGG TGATGCTTAC GCCGCGTTAT AGTGAGTTGG TACTAGAAAT TATCGGTCAT
TCATCAAATA TTATTTTGGT CGATGATAAT GGCTTGGTCT TGGAAAGTAT TCGTCACTAT
AACCCGCAAC GTTCGCAACG CCCAATCATG CCACGTGGCA TGTACGAAGC GCCGCCCAGC
CAAGGCAAAT CTGACCCGCT CCAAGCAACC GCTGAACAAA TTGCGGCCTT AGGTGGCGAT
TTGGCCAAAG CCTTGGTGAC CGAATATAGT GGCATCTCGC CGCAAACTGG CCGTGAAATT
GCTTGGCGGG CAGTCGGCGC AACCAGCGTC GAAATTACGC CAGAACTAGA TTTTGCACAG
ATTGCCCAGC TTTTGCGCCA ACTTACCAGC CTTAGCAGGA GCGAGCCAAC CCTTGCCCGC
AATGCTGATG GCACGCCAAT TGGGATTGCT GCCTTTAATT TGCAGCACCA AGCGCATACT
GAAACCTTCC CCAGCATGAA CGAGGCCTTG GCAACCGCCT TTGCTGAGCT TGATCAGGTG
ACAGCGCACG CTCAACGGCG TGAAGCCTTG CTCGAACGGG TGGCTGAAGC TCAGCGCCGC
ATCAAAACCA AAGCTGATCA ATTACGCACT CAGTTGGCGC GGGTTGAGCA ACTTGAGCGT
TTGCGCTGGG AAGGTGAGAT GATTTTTGGC TATATTTATG CGATCAAACC TGGACAAAGC
GAATTGCTGC TTGACCAAGG CGTGATCACG CTTGATCCAA CATTATCGGC GGTCGAGAAT
GCTCAAGCAA AATTTCGCGA GTACGACAAA GCCAAAGGTG CATTAGAGGA TGTGCCACAA
CTCTTGGAGC AAACCGAGGC TCAAGCCGAA TATTTGCAAC AAACCAACGA TCTGTTGAGT
TTGGCCGAGA GTTTTGCTGA AATTGAGCAA TTTGAGCGTG AGTTGATTGC TGGTGGCTGG
CTGCGCCAAA CGATTGGCAA AGCCAAAAGC AAGCCCAATT CTAGCGTTGG GCGTGGCCCG
TTGCGGGTAA TTTCGCCCGA TGGCTGGACA ATTTTCGTTG GTCGTACCGC TGACCAAAAC
GATGAAGTAA CCTTCAAACT TGGTCAGCCC GAGGATTATT GGTTGCATGC CCGTGAACGA
ACTGGCGGCC ATGTGATTAT TCGTATGCAA TCGGCGAATG TGCCGCCGCG TACCCTTGAG
CAAGCGGCGC AACTGGCGGC CTACTATTCA TCGGCTCGCA ACGATGGCGC AGTTGAAGTC
GATATTGCCT TACGCAAACA TGTGCGCAAA ATCAAAGGCG GCCCACCTGG TTTAGTGCGC
TATACCGCTG AGCAAACCCT ACGCGTCGCA CCCCAAAAAG AACCGAAGAG AACATAG
 
Protein sequence
MHIDALVVAA IVAELQILVG GKIQQVVLPN PDSVGFEVYA DGQRHQLLLS ANPKFARMHT 
TPTKLTRDPN ADSPLLLLLR KYVRGGRITK IESAPLERVI SLSIAKMPIP RKELEPDDDD
DDEVMLTPRY SELVLEIIGH SSNIILVDDN GLVLESIRHY NPQRSQRPIM PRGMYEAPPS
QGKSDPLQAT AEQIAALGGD LAKALVTEYS GISPQTGREI AWRAVGATSV EITPELDFAQ
IAQLLRQLTS LSRSEPTLAR NADGTPIGIA AFNLQHQAHT ETFPSMNEAL ATAFAELDQV
TAHAQRREAL LERVAEAQRR IKTKADQLRT QLARVEQLER LRWEGEMIFG YIYAIKPGQS
ELLLDQGVIT LDPTLSAVEN AQAKFREYDK AKGALEDVPQ LLEQTEAQAE YLQQTNDLLS
LAESFAEIEQ FERELIAGGW LRQTIGKAKS KPNSSVGRGP LRVISPDGWT IFVGRTADQN
DEVTFKLGQP EDYWLHARER TGGHVIIRMQ SANVPPRTLE QAAQLAAYYS SARNDGAVEV
DIALRKHVRK IKGGPPGLVR YTAEQTLRVA PQKEPKRT