Gene Haur_1384 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1384 
Symbol 
ID5733276 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1598769 
End bp1600202 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content57% 
IMG OID641278522 
Producthypothetical protein 
Protein accessionYP_001544157 
Protein GI159897910 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGAATG CTGTCGAACT GGCAAAGTGC CATGCCGACC CTGCCTACTT CACCCACCAC 
TTCGGATTAA TCGACGATGC CCAAGGGTTG GGGGATGGCA GCGGCGCGAT GCCCTTTACG
CTCTGGCCTG CCCAAGTACA GGTGCTCTGG ACACTCTTAA TCCAGCGTTT GGTCTTGATT
CTCAAGGCAC GGCAATTAGG CATTAGTTGG TTGTGTTGCG CCTATGCGTT GTGGCTCTGC
CTCTTCCAAC CGGGCAAAGT GGTGCTGATC TTCAGCAAAG GCCAAGGCGA AGCCGATGAG
ATGCTGCAAC GGGTCAAGCG CTTGTATGAA CGCTTACCCG AGTGGATGCG TGAGGCCTCG
CCAGCGCTAG TCACGGACAA CACGACCGAA CTGGAATGGG CCAATGGCAG TCGGGTCAAG
TCACTCCCCG CGACCAAAGG GGCAGGTCGC AGCTTTACGG CATCACTGGT GATTTTGGAC
GAAGCCGCGT TCCTGCTCTG GGCAACCCAG TTGTATACCG CGCTCAAGCC CACGATCGAC
GGCGGCGGCC AACTGATTGT CTTATCGACG GCCAATGGGA TTGGCAATCT GTTTCATCAA
CTGTGGCTCA AGGCCATCAG TGCCAAAAAT CGCTTTACCA CGATCTTTTT ACCGTGGTGG
GCACGACCAA CGCGGGATGC GGCGTGGTAT CACAACCAGC TTGAGGAATA TACCGACCCG
GACATGGTGC GGCAAGAATA TCCCTCAACC GCGCAAGAAG CCTTTTTGGT GTCAGGGCGC
ACGCGATTCA AAATGCCATG GCTGCTGAAG CAAAATCCCA GCGATGGCCT TGCGCTCGAT
GAGCTTCCAG CATCGCTCTC GCTCCTGGAC GGCGTGACCG TTTACCACCT GCCGCAGCAA
GGACGACGCT ACATTCTGGC AGCGGACGTG GCCGAAGGGC TGGAGCACGG CGACTTCTGC
GCTGCCACCT TGATCGATGC GGTGTCGTGG GAAGAAATGG CCAGCGTCCA CGGCAAATGG
GAACCGGACG AATACGCTCG CATTCTCATG AATTTGTCAG ACGTGTACGG GGCCACGGTT
GCGGTTGAGC GGAACAATCA CGGTCACGCG GTATTGACCA CGATGAAACT GGCCGGATTC
ACCCGCATCG TGTATGGCCT TGATGGGCGA GCAGGGTGGC TCACCAATGC GCAGACCAAG
CCGCAAATGA TTGACTTGTT CGCCACGGCA CTGCGCGATG TGTTGGTCAC AATTCGCAAT
CAAACGGCGC TGAATGAACT GGCGATTTAT CGGATTTTGA AGAACGGCGG GACGGGCGCA
CCCGCAGGCT ATCACGATGA TTTCGTGATG GCATGGGCCA TTGCCCTGAT GGTTGCCAGT
CAACCAATGG AAGTCGAAGA CGAAGCCATG GCTGGCTCGT GGAATAGCTA CTAG
 
Protein sequence
MSNAVELAKC HADPAYFTHH FGLIDDAQGL GDGSGAMPFT LWPAQVQVLW TLLIQRLVLI 
LKARQLGISW LCCAYALWLC LFQPGKVVLI FSKGQGEADE MLQRVKRLYE RLPEWMREAS
PALVTDNTTE LEWANGSRVK SLPATKGAGR SFTASLVILD EAAFLLWATQ LYTALKPTID
GGGQLIVLST ANGIGNLFHQ LWLKAISAKN RFTTIFLPWW ARPTRDAAWY HNQLEEYTDP
DMVRQEYPST AQEAFLVSGR TRFKMPWLLK QNPSDGLALD ELPASLSLLD GVTVYHLPQQ
GRRYILAADV AEGLEHGDFC AATLIDAVSW EEMASVHGKW EPDEYARILM NLSDVYGATV
AVERNNHGHA VLTTMKLAGF TRIVYGLDGR AGWLTNAQTK PQMIDLFATA LRDVLVTIRN
QTALNELAIY RILKNGGTGA PAGYHDDFVM AWAIALMVAS QPMEVEDEAM AGSWNSY