Gene Haur_4010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4010 
Symbol 
ID5735871 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5116857 
End bp5118188 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content48% 
IMG OID641281160 
Producthypothetical protein 
Protein accessionYP_001546770 
Protein GI159900523 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCTACA ATCAATGGAT CGATCAGCAA GTTGATAGTG ATTTTCAAAA AGCGCGATTT 
AGCGCGTTTT TGAATGCAAT AAAAGCACGT TTGCGCGACG AACCGCAAAC GCTCTTACCA
TTTGAAGAGG TTCGTGCTCG CCTCAATATT CGCAGCCAAA CCGATCGGGG CATGCAAATT
GTGCCGTTGC AATCGATTAT TGGCAGCGAA GGCCGATATA GCGATTTCGA CCGCCAATTT
TTGCCACGCC ACGAGGTAAC TAAAGCCCGT TGGAAAAACG TTGATCGGGC ACATTATAAC
GATGTGATGC TCCCGCCAAT CGAACTCTAT CAAATTGGCG AGGTCTATTT TGTGCGCGAT
GGCAATCATC GAGTGTCAGT CGCTCGCCAA CAAGGCCAAG ATTTTATCGA TGCCCATGTG
ATCGAATTGC TCAGTGATGT GCCAATCAAG CCAACCATGA CCCAAGATGA ATTGAATCAA
TTGGAAGAAC GCTCAGACTT TTTGGAATGG ACGAATTTGG CGCAGTTGCG GCCTGAAGCC
CAGTTTATCG AGCTAACCAC ACCTGGTGGC TATTTGGATC TGATTCGGCA TATCAACGGC
CATCGCTATT TCAAATCGTT GGAACTTGGC GAGGAGCCAA CCAGCGAAGA GGCGATTTTA
AGTTGGTACG ACAATATTTA TAAACCTTTG ATCGACGAAA TTTATGATAC TGGGATTTTG
GCGGCCTTCC CCGAGCGCAC CGCCACCGAC TTGTATCTGT GGATTATGGA TCATCGTCAT
TATCTGACCC AGACTGAGGG GATCGATCCA GGCCCGAAAA CTGCGGCGAT CGACTATACC
CGCCATTTTG GCGAGCGCAA AAATCGGGTT AAGTTACCTG ATCCGCCCAG CCATGCCGAA
CTAGAGTTTA TTCAGTGGAG CAAACTCAAT CTGCTGCGCC AAGATGTGCG TGTGCCGCTC
AGCAACGATA ACGATTATGC GCGGATCAAA TTGCATGTGA TCGATCATCA ATATTTTATG
GGCAAGGATC TTGATCGCGA AGTCACGTTC GAGGAAGCTG TACAAAGCTG GTACGATACG
GTGTATCGCC CAGTAACCCA AGCCATCGCT CAACAACATA TTGGTGAGAT GTTTCCACGC
CATACAATCG GCGATTTATA TTTGTTGATC ACCGATTATT TGCATATGCT ACGCGGTCAG
GGCGTGGAGA TTCAGCCATT ACAAGCGGCC CGCGAATATG CCGAACGCTT TGGTAGCGAA
CGTGGAGCCT TTCTGACAGG CGTGTTGCAT CGAGCACGGC GATTGATGAA ACGAGCCTTT
GCAACAACAT AG
 
Protein sequence
MGYNQWIDQQ VDSDFQKARF SAFLNAIKAR LRDEPQTLLP FEEVRARLNI RSQTDRGMQI 
VPLQSIIGSE GRYSDFDRQF LPRHEVTKAR WKNVDRAHYN DVMLPPIELY QIGEVYFVRD
GNHRVSVARQ QGQDFIDAHV IELLSDVPIK PTMTQDELNQ LEERSDFLEW TNLAQLRPEA
QFIELTTPGG YLDLIRHING HRYFKSLELG EEPTSEEAIL SWYDNIYKPL IDEIYDTGIL
AAFPERTATD LYLWIMDHRH YLTQTEGIDP GPKTAAIDYT RHFGERKNRV KLPDPPSHAE
LEFIQWSKLN LLRQDVRVPL SNDNDYARIK LHVIDHQYFM GKDLDREVTF EEAVQSWYDT
VYRPVTQAIA QQHIGEMFPR HTIGDLYLLI TDYLHMLRGQ GVEIQPLQAA REYAERFGSE
RGAFLTGVLH RARRLMKRAF ATT