Gene Haur_2950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2950 
Symbol 
ID5734822 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3723119 
End bp3724324 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content54% 
IMG OID641280094 
Producthypothetical protein 
Protein accessionYP_001545716 
Protein GI159899469 
COG category[S] Function unknown 
COG ID[COG3214] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTCGGA CTATCTCGCT TGATCAAGCG CGGGCGATTG CAATTGCTGC CCAAGGCTTA 
GATCGTCGTC CCAACACTGT TGATCAAACC ACATTGCAAC AAACCATCCA GCGCATGCAA
ATTGTCCAGA TCGATACGAT CAATGTGGTA GCACGTGCGC CCTACTTTGT GCTCTGGAGC
CGTTTGGGTG ATTATGATCC TGCTTGGTTT GAGCAATTGC AATATCCGGC GGGTCAATTA
TTTGAATATT GGGCACATGC AGCCTCGTTC TTGCCGATCG AATTATTTCC CTTGCTGCGT
CCAGCGATGT TGCGCTATAT CCACGATTGG CACAGGTCGC GGCGTTGGCT CGAGGAAAAT
AAGGCGGTGG CTGATGGTGT GCTGGCCATG ATTCGCGAAC GTGGCCCGTT GCGTTCAGCC
GATTTTGAAG CCCCAGCCGA TCATGCTGGC GGTGGCTGGT GGAATTGGAA ACCAGCCAAA
TCGGCGCTCG ATATCCTCTG GGGCATGGGC GAATTGATGA TTGTGCGCCG CGAGAAATTC
CAACGGGTTT ATGAATTGAC CGAGCGGGTT GTGCCCGATT GGGATGATCG CGATGTGCTT
AGTTTTGAGG CAGCCGCCGA GCAATTGAGT GAGCGTGGCT TGCGGGCGAT GGGGATTGCC
ACCGAGCGGC ATCTGCCCGA TTACTTTCGC CAACGCCGAC CAGGCATCAA AGAACGCTTG
CAAACTTGGG CGGCTGAAGG CAAGGCAATT GAGGTGCATG TGGAAGGTTG GAAAACTCCA
GCCTATATTC ATCATGACCA GCGCCATTTG TTGGAGCAAA CACCCACGCC CAGTCTGACG
ACGATGCTTT CGCCCTTCGA TAACTTAATT TGGGATCGCC AACGCACGCT GGATTTTTGG
CAGTTTGACT ATCGGCTTGA ATGCTATACC CCAGCGCCCA AGCGACGCTA TGGCTATTTC
ACCTTGCCAA TTCTCTGGAA AGATCGAGTA GTTGGGCGGA TTGATGCCAA GGCTGAGCGC
AGTGCTGGGA TTTTTCGGGT GCATGCCCTG CATTTGGAAC CAAACATCGA GCTAAGCGAA
GCTTTATACG ACGATCTGGC CGTGATGCTG CGTGATTGTG CCAATTGGCA TCGCACGCCA
ACCATCAGCA TTACAATGAG CGATCCGCCG CAGGTGGCTG ACGAAGTGCA AGCCCGCGTG
AGCTAG
 
Protein sequence
MTRTISLDQA RAIAIAAQGL DRRPNTVDQT TLQQTIQRMQ IVQIDTINVV ARAPYFVLWS 
RLGDYDPAWF EQLQYPAGQL FEYWAHAASF LPIELFPLLR PAMLRYIHDW HRSRRWLEEN
KAVADGVLAM IRERGPLRSA DFEAPADHAG GGWWNWKPAK SALDILWGMG ELMIVRREKF
QRVYELTERV VPDWDDRDVL SFEAAAEQLS ERGLRAMGIA TERHLPDYFR QRRPGIKERL
QTWAAEGKAI EVHVEGWKTP AYIHHDQRHL LEQTPTPSLT TMLSPFDNLI WDRQRTLDFW
QFDYRLECYT PAPKRRYGYF TLPILWKDRV VGRIDAKAER SAGIFRVHAL HLEPNIELSE
ALYDDLAVML RDCANWHRTP TISITMSDPP QVADEVQARV S