Gene Haur_4127 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4127 
Symbol 
ID5735988 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5274315 
End bp5275685 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content52% 
IMG OID641281281 
Producthypothetical protein 
Protein accessionYP_001546887 
Protein GI159900640 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCCGAT TGTTATGTTT GCTGATCATT CTGCTGCTCT GGCCGCATCC TGTGGCTGCT 
CAAACCAATC TTCCTCAATG GCTTGAACGC AAAACTACCT ATATTTCTAT TTTGTATCCC
CAAGGCAGCG AAGCCGAAGC CGAGCGCTAT GCTGGCTATA GCGACGCGAT CTACGAGGAA
GTTACGGCGG TCGTTGGCTA TCGACCTGCC CCACCCTTGA CGCTGCGCAT CTACCCTACC
AAAGAACTCT ATCAACAGGT AAACCCGGCG GCCCGTTGGC TGGAAGGCAT CGTGGCTCAT
GCCCACACTG GGCGACGCGA AATTAGCATT GCTGTGCAGC AAACTGTAGG CATGAGCGAT
GAAGAATTAC GCAATAATGT GCGCCATGAA TTAATGCATA TCATTGCTGC CGAGCTTTCC
GATGGCCGAT TAAGTACGAT GTGGCAGGAA GGCATTGCCC AATATGTTGA AGTGCCAACC
AGCCAAAGCG GCTATAAAAT TGCCTTGCTC AAACAAGCCC TCGAGAATAA CGCGCTTGCG
ACCTGGCGCT TATTAGATAG TGCTGGCGCA GTCTACGATA ATCCAGAACT GGGCTATCCA
CAAAGCTGGT CGATGGTCTC ATTTTTGATT CAGCGTTATG GTATGGCACG ATTTTTGGCT
TTTTTAGAGG CGTTGCGCAC GGCTAGTGGC TATCGTTCAG CCCTCAGCCA AGCCTATAGT
CTTAGTGCCG AAAGCCTTGA AAGTGAATGG CTGGCTCAAC TGCCAACCTG GATCGATGGT
GGTTGGAAGC AAGCGCCGAG TGTCGCCTTC GATCAAGCGA GCATCGAAAC AGCTCTGGCT
GCTGGACGTT ATAGCGAGGC CTTGACTGCC GCCGAAACCG CCCTGACGAT CAAGGATGAT
CCGGCGATTG CGGCGCTGCG CGAACAAGCT CGCAAAGGCG TGCGAGCCGA AGATGCTGCT
GCCGCCGCCC GCGTGGCGCT ATTGGAAGGC CGTTATGCTG AGGCCAAAAC CGCGATCGAA
CAAGCGTTGC CATTATTTGC TGATTTGGCG CGAATTGATC GCCAAAAGCT GCTGAATGAT
TACCTACAAC GCGCTGACCA AGGCCTCAAA GCCCAGCAAC TACTCGAAAC CGCCCGCCGC
GATTTGAATG GAATTCGCAT AGTTGCTGCG CGTAATAATA TTGAGCAAGC TGCTAATTTA
TTTAGCCAGC TTGGCGATAA TAATGGGCGC AGCCAAGCCG CTCAGTTGCT TGAATCGCTT
AATCTACGGC TAAAAATTGT GGGAATTGGC TTGATTGTGG TGGTTGGGTT GGGTTTAGCG
TGGAATATTG ATCGGCGACG AGCTATGCGC AAGCGGATGT TGCCGCTGTA G
 
Protein sequence
MRRLLCLLII LLLWPHPVAA QTNLPQWLER KTTYISILYP QGSEAEAERY AGYSDAIYEE 
VTAVVGYRPA PPLTLRIYPT KELYQQVNPA ARWLEGIVAH AHTGRREISI AVQQTVGMSD
EELRNNVRHE LMHIIAAELS DGRLSTMWQE GIAQYVEVPT SQSGYKIALL KQALENNALA
TWRLLDSAGA VYDNPELGYP QSWSMVSFLI QRYGMARFLA FLEALRTASG YRSALSQAYS
LSAESLESEW LAQLPTWIDG GWKQAPSVAF DQASIETALA AGRYSEALTA AETALTIKDD
PAIAALREQA RKGVRAEDAA AAARVALLEG RYAEAKTAIE QALPLFADLA RIDRQKLLND
YLQRADQGLK AQQLLETARR DLNGIRIVAA RNNIEQAANL FSQLGDNNGR SQAAQLLESL
NLRLKIVGIG LIVVVGLGLA WNIDRRRAMR KRMLPL