Gene Haur_4072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4072 
Symbol 
ID5735930 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5198044 
End bp5199459 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content53% 
IMG OID641281223 
ProductF0F1 ATP synthase subunit beta 
Protein accessionYP_001546832 
Protein GI159900585 
COG category[C] Energy production and conversion 
COG ID[COG0055] F0F1-type ATP synthase, beta subunit 
TIGRFAM ID[TIGR01039] ATP synthase, F1 beta subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00152005 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAACTG GAAAAATTTT ACAAATTACT GGCGTGGTTA TCGACGCAGA ATTTCCTGCC 
GATGGCCTGC CACAAATTTA TAACGCGTTG GAAATTCCCT TGGGCGAAGG CCGCTCATCG
CTGATCTGCG AAGTCCAACA GCAGTTGGGT GATAGCGTGG TTCGCGCGGT CGCTATGTCC
ACCACCGACG GCCTTGTCCG TGGGATGGAC GTAATCGACA CTGGCGCACC AATCAGCGTG
CCAGTCGGCC CCGAAACCTT GGGTCGGGTG TTCGACGTTC AAGGTCGGCC AATCGACGGT
GAAGGCGCGG TTGGTACTAC CAAAACCATG CCAATTCACC GCCCAGCCCC AACCTTTGAA
GAACAGTCAA ACCGCGCCGA GTTGTTCGAA ACCGGCATCA AGGTTATCGA CTTGATCGCG
CCCTTCACCA AGGGTGGTAA AACTGGGGTG TTCGGTGGCG CAGGTGTGGG CAAGACCGTT
ATTATCCAAG AGTTGATCTC GAATATCGCT AAAGAACAAT CCGGTTACTC GGTGTTCGCA
GGCGTAGGCG AGCGCTCACG CGAAGGTAAC GACTTGATCC ACGAAATGAA GGATTCAAAG
ATTCCTGGCA CCGACCAAAC CGTGTTCGAT AAAACGGTGA TGGTGTTCGG TCAAATGAAC
GAACCACCAG GAGCACGCTT GCGGGTGGCG CTTTCAGCCT TGACCATGGC TGAATACTTC
CGCGAAGAAG GCCGCGACGT ACTCTTGTTC GTCGATAACA TCTTCCGCTT TACCCAAGCA
GGTTCGGAAG TATCGGCGCT CTTGGGCCGG ATGCCTTCAC AGGTGGGTTA CCAACCAACC
TTGGGTACCG AGATGGGTGA ATTGCAAGAA CGCATCACTT CGACCAAGAC TGGCTCGATT
ACCTCGTTGC AAGCCGTCTA CGTGCCTGCT GACGACTACA CCGACCCAGC TCCAGCGACA
ACCTTTGCTC ACTTGGATGC AACGATTTCG CTGGAACGCT CGATCTCAGA AAAAGGTATC
TATCCTGCGG TGGACCCACT GGCTTCAACC AGCCGGATTC TCGACCCCAA CATTGTGGGC
GAAGAACACT ACCGCGTCGC GACCGAAGTT CAACGGATGT TGCAACGCTA CAAAGACTTG
CAAGATATTA TTGCAATTTT GGGTGTCGAA GAATTGTCAG ACGACGACAA ATTGACAGTT
TCACGCGCCC GCAAGCTCGA ACGCTTCTTC TCACAACCAT TCGGCGTGGC TGAAGTGTTT
ACCAACATTC CAGGCAAGTA TGTCGCGGTT GGCGATACCG TCAAGAGCTT TGCCCGCGTC
TTAGCAGGCG AGTTCGACCA CATTCCCGAA AGCTTCTTCT TTATGAAGGG TGGCATCGAC
GATGTGGTAG CCGCTTACGA TGCGTCAAAG CAATAA
 
Protein sequence
MATGKILQIT GVVIDAEFPA DGLPQIYNAL EIPLGEGRSS LICEVQQQLG DSVVRAVAMS 
TTDGLVRGMD VIDTGAPISV PVGPETLGRV FDVQGRPIDG EGAVGTTKTM PIHRPAPTFE
EQSNRAELFE TGIKVIDLIA PFTKGGKTGV FGGAGVGKTV IIQELISNIA KEQSGYSVFA
GVGERSREGN DLIHEMKDSK IPGTDQTVFD KTVMVFGQMN EPPGARLRVA LSALTMAEYF
REEGRDVLLF VDNIFRFTQA GSEVSALLGR MPSQVGYQPT LGTEMGELQE RITSTKTGSI
TSLQAVYVPA DDYTDPAPAT TFAHLDATIS LERSISEKGI YPAVDPLAST SRILDPNIVG
EEHYRVATEV QRMLQRYKDL QDIIAILGVE ELSDDDKLTV SRARKLERFF SQPFGVAEVF
TNIPGKYVAV GDTVKSFARV LAGEFDHIPE SFFFMKGGID DVVAAYDASK Q