Gene Haur_3361 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3361 
Symbol 
ID5736903 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4238588 
End bp4240006 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content51% 
IMG OID641280508 
Producthypothetical protein 
Protein accessionYP_001546125 
Protein GI159899878 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000455234 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGTTTG CGAAGCTGCG TGTACGGGCA ATGATGGTTG CCTTGTTGCT TTCAATCTTT 
GCAGTAGGCG GTCGGGGCGT ATCAGCTCAA ACCAATGCCT ATGCAACCGC ATTCACCACC
TCGATCACCT ACCAAAACAT TGGTACTGGT ACTGCTAACA TCAACTTGAC CGTGTACAGC
TCAAGTGGAA CTCCATCAGC CATCCCAGCC TCAACCTTGG CTGCTAATGG CGCTGGTGCC
TACTTTGTTG GCTCAGTCAG TGGCTTGGGC ACCACCTTCA ATGGTTCAGC AGTGATCTCA
GCAGATCAAC CAATTGCTGC AACCTTGGTG CAAATCCCAG CCGCAGCTTC ACAAGTCAAG
AACCGCCCAT TGTCAAATGG TTTCTCAAGC GGCTCAGACA CCGTGTTGAT TCCAACGGTT
TTGAAGGCAT CATCAAACTA CACCACCAAG TTTGTGATTC AAAACACCGA CTCAGTGGCA
AATGACTTCA CCGTTCAATT CATCAATCCA GCAACTGGGG CAGTTGTTCA CACTGCTAAC
CCAACTAACG TCTTGCCAAA CACCTCAGTC TACTACGATG CTGGCACGAT TTCAGCCTTG
GGTGCAAGCT TCAGCGGCTC AGTCAAAGTA ACGGCTGTCA AGAATGGCAC CAGCAACCCT
GGTAGCGCCG TTGGTACCGC CCTTGAATTG CAAACCAATG GTGTTGGTGC TTATGCTTCA
CAAGCATTCC CATCAACTGC TGCTGCAACC AAAGTTTCGA TGGCAACTGC CCTCTGTAGC
TATGTGATTC CAAGTGGTCA AACCACCTCG TTCTATGCAG TCCAAAACGC TGGTACTTCA
TCAGCAAGCG TGACTGTAAC CTACGTTGGT ACCGCTGCTG GTTCACCAGT CAACGTTACA
AGCACCGCAG TCAACATTGC TGCTGGCGCT AAAGCTAGCT TCAATCCTTG TGGTACCACT
CCAACCAACT TCACTGGCTC AGCAACCATC AACTCAACCC AACCAATCTT GGCTGTTGGT
AAAGTTAATG GTGGTGGCTT GTACACCGCA TTCGAAGGTG CAACCGCTGG TAGCGCCAAG
ACTGCATTGC CATACGTTCG CTGGTTGACC CCAGCTCAAG GTGGCCAACA AACCTACATC
GCTATCCAAA ACGTTGGCAC GAGCGCAGCA AGCAGCGTAA CCGTCAAGTA CTATAGCGGT
GCTGGTGCAT TGCTCGGTAC TCACACCATC CCAAGCATCG CTGCTGGCGC TAAAGCTAGC
TCAAACCCAA CCAACGCTGG CGTAACCAAT ATGGGTGTTG GTGGTGGTTC AGCCGTGGTT
GAAGGCGCTG GCGCTCAATT GATTGTGGTT GCCCGCGTAA CTTCACCTGT TGGTACTGGT
ACCACCGGCG AAGACTACAA CGGTATTCCT TTCAACTAG
 
Protein sequence
MTFAKLRVRA MMVALLLSIF AVGGRGVSAQ TNAYATAFTT SITYQNIGTG TANINLTVYS 
SSGTPSAIPA STLAANGAGA YFVGSVSGLG TTFNGSAVIS ADQPIAATLV QIPAAASQVK
NRPLSNGFSS GSDTVLIPTV LKASSNYTTK FVIQNTDSVA NDFTVQFINP ATGAVVHTAN
PTNVLPNTSV YYDAGTISAL GASFSGSVKV TAVKNGTSNP GSAVGTALEL QTNGVGAYAS
QAFPSTAAAT KVSMATALCS YVIPSGQTTS FYAVQNAGTS SASVTVTYVG TAAGSPVNVT
STAVNIAAGA KASFNPCGTT PTNFTGSATI NSTQPILAVG KVNGGGLYTA FEGATAGSAK
TALPYVRWLT PAQGGQQTYI AIQNVGTSAA SSVTVKYYSG AGALLGTHTI PSIAAGAKAS
SNPTNAGVTN MGVGGGSAVV EGAGAQLIVV ARVTSPVGTG TTGEDYNGIP FN