Gene Haur_3221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3221 
Symbol 
ID5735089 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4075136 
End bp4076680 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content52% 
IMG OID641280367 
Producthypothetical protein 
Protein accessionYP_001545986 
Protein GI159899739 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.135564 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACAAC GAATTAATGT CGGTATGAAT CCAACCGTAG CGGTCCGCAT TTGTGAAGGT 
AGTTTGCGGG TAATTGCCCA CGATTTGCCC GAAGCTTGGT TTGATGTTGA TGAAGATGAA
ATTAGTTTTC GCGCCCAAGA AGGCCATCTC TCAATCGAGC GTTGCTCCGA TGATTTGGAA
TTACGCTTGC CCCATGGCGC AGTTTTAACC ATTGATGTGG TTCAAGGCGA TGTTGATCTG
ACTGGTTTGA GCGCTGTGCA TACCCGCCAA ATTGAAGGCG ATATTAGCGC CCGTGATGTG
CAAACCTTTG AAAGTGATTC GGTGCACGGC GATGCTAGCT TTACTAAAAG CGCCAAACTT
AGCCTTGCCA ACATTGATGG TGATCTCAAA ATCTATGAGA ACACTGATAC TGTGGTGATT
AAAAATGTCA ATGGCGATGC CAAAATTAGC CAAGCCCACA ATTTGACGAT TATCAATGTG
AATGGCGATT TGAGCGCAAG CGATCTGTTT GGCGATGTAG CGATTACCCG CGTCAGTGGT
GATGCTACGT TGCGTGGCGC GATCAAGAGC CTTGCGCCAA TCCATGTTGA TGGCGATTTG
AAATTGGGCA TCAATTGGCT ACCTGAGCAA GTCTATCGTG CTAGTGCCAA TGGCGATGTT
GTGTTAGAAG TTGCCGATGA TGCTAATTTG AGCGTCAATG GTTTTGTCCA AGGCGATGTC
TCAGGCATGG GCGATCGTGA GCCTGGCTCG ATTAGCCTGA CTTGGGGTAC GGGTACAGCT
CGCTTAGAAT TGAATGTGCA AGGCGATTTG AGCATTCGGG GCGGAGCTGC TAGCCATAGC
CACTCCAAGA GTGTTGGTGG CACAAGCTGG AACACCAACT GGAATTGGAA TAACGATGAT
TTTAACCGCG CTATTCGCGA TTTCACCGAT GATTTAGCCT CGATGGGCCG CGATATTGCC
GCTCAATTCC GTGAAATGAG CCGCGATTGG CGTGATGGCA AGGGCGAACG TACCGCTGAA
CGTGCTCGCC AAGCCACTGA ACGCGCCGCC GAACGTGCTG CCAAAGCCGC CGAACGCATG
AGCGTGCGGA TCAACGAACG CGAATATCGC TTTGATCCTG AGCGAATCGA GCGCTTGAAA
GAGCAAGCCA AGCGGGCTGC TGATGAAGGC ATTAGCCGTG CTTACGAAGC AATTGGCCAA
GCCTTGGGCA ATATCGAAAA AAATATTGCT AACCCAAATG CGCCCCGCCC GCCAGCCCCA
CCAGCGCCGC CAGCCGCACC GCATGCGCCT CAAGCACCAA ACGCCCCGCA TCGGGTGTCG
ATTAGCGAAG ATGATGGCTC TTCATCAAGC CAACATGTGG CCTACACTGG CGATACGGTG
CGGATTTCGC CAGAGCAAGC TGCCGCAGCC CAAGCTGCCA CCGCAGCCCC AGCCGAAACG
CCAGCCGTCG ATAAAACCCA AGAACGCTTG GCAATTTTGA AGATGGTGCA AAGTGGCAAG
ATTAGCGCTG ACGAAGCGGC ACTCTTGCTC GAAGCCTTAG GCTAA
 
Protein sequence
MQQRINVGMN PTVAVRICEG SLRVIAHDLP EAWFDVDEDE ISFRAQEGHL SIERCSDDLE 
LRLPHGAVLT IDVVQGDVDL TGLSAVHTRQ IEGDISARDV QTFESDSVHG DASFTKSAKL
SLANIDGDLK IYENTDTVVI KNVNGDAKIS QAHNLTIINV NGDLSASDLF GDVAITRVSG
DATLRGAIKS LAPIHVDGDL KLGINWLPEQ VYRASANGDV VLEVADDANL SVNGFVQGDV
SGMGDREPGS ISLTWGTGTA RLELNVQGDL SIRGGAASHS HSKSVGGTSW NTNWNWNNDD
FNRAIRDFTD DLASMGRDIA AQFREMSRDW RDGKGERTAE RARQATERAA ERAAKAAERM
SVRINEREYR FDPERIERLK EQAKRAADEG ISRAYEAIGQ ALGNIEKNIA NPNAPRPPAP
PAPPAAPHAP QAPNAPHRVS ISEDDGSSSS QHVAYTGDTV RISPEQAAAA QAATAAPAET
PAVDKTQERL AILKMVQSGK ISADEAALLL EALG