Gene Haur_3145 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3145 
Symbol 
ID5735017 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3972667 
End bp3974553 
Gene Length1887 bp 
Protein Length628 aa 
Translation table11 
GC content50% 
IMG OID641280288 
Producthypothetical protein 
Protein accessionYP_001545910 
Protein GI159899663 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCAGT CAGCCCGGTG GCGTGGGTGG TTTTGGCTTG GTTTAGGCCT AATCTGCCTA 
CTTTGGCCCA ACCTAAGTCA TGCCCAAAAT GCCGATGCCA TCACCATCAC TATCGATCAA
GTTGGCTTTG ATGCCCAAGG TCATGTAGCG AATGGTGGTT GGTATCCAAT TATTACAACG
ATCGAAAATA GCGGAGCCGA CCTTCAAGCC CAAGTTGTGG TTACCACAGG CTTTGGTCAG
GCCGATTTGC TGCAAAATGT CGATTTACCT GGTGGTTCGC GCAAACAAGT GCGTTTACTC
ATGCGGGCCA ATCTCAATCA AACCGTTGTC GAAATCAAAG TTATTGATGC TCAAGGCAAG
CAGTTAGCCC GCAATCGCAG TAATGTGCGG GTACACGATA GCCAAGAAAT TTTAGTAGGC
GTATTTGGCG GGCCAGCCTC AAGTTTAGCT GGCGCTTCAG TTCCTGGCCG CCTCACCACG
GTGATGCCGC TTGACCCAGC TCAACTGCCA AGCACCGATG GCGAATTGTT TAACTTTGCA
GCAATCGTGC TCCAAGATGT TCAACCGACT GCCGAGCAGG CCGCCGCTCT CGAACGTTGG
GTCGCAACTG GTGGAACATT GATCGTCAGC GGTGGCCCGA ATAGCGCCGA ACTGCCCAAA
GAACTAGCCT CACTCTTGCC TGCGAGTGTT AGCCGCAGCA ATAGTTCGGC GGTGTTAACC
ACGCTCAATG GGCGCAATAC TCCCGCTTTT GCCCAAATCA ACCTACGGGT TAACCAACTT
CAACCAACCG CCGATGCTAG CATATTCGGC GTTGGGGCCA ATAACGAAGC CTTAGTGATC
AGTCGCAAGC TGGGCATGGG TCAAATCTTA GTCACAGCCT TCAATCCCAG TGACTTACCC
GCCGAAGTTA ATGATCGGTT GGTTTGGCCA GTGCTCTTAC AACCCCAACT TTACCGCGAT
TGGAATGTAG CGCTCTCGCC ATGGTCAATC CAGATTCGCG GCACTGATCA AAATTTGCCT
TCAGTTTTAG GCTTGATGGG AATTTTGTTT GGCTACATTC TGCTGATTGG CCCGATCAAT
TACTTTATTT TACGACGTTT GGATCGGCGG GAATGGGCTT GGTTTAGTAT TCCATTGGTT
GTACTTGGCT TTGTGGGTAT TATGTATTTA GCTGGTGGCG ATTTACGTAC TGGCAATATC
AATGTCACAA CGATTAACAT TATCGATAGC CAACTGGGAG CCGATCAAGG CCGTCTAAGC
GTCAATTATG GCTTCAATGC AGGGCGACGC GGCGCATGGA ATGGCAGTAT TGATGCCAAT
TTAATTGCTG GCAACCAACC AATGCAAGGC TTTGGCGATG AGGGTTCGGG CACAATCGAG
CAAACCAACG ATGGCAAAAC CCGCTTGCCC AATTGGCAAA GCAACATCGG CCAAATGCAA
ACCCTCGCCG CGATTGGCTC AAGCGCAGTT CCCTACAATT TTGAGGTTAA AGTAGCCAAA
GCCAATTCGT GGGAAGGTGC AACCATTACC AACCGGAGCG AACGCAAGGT TGAATATGCG
ATTCTGTTCA ATGGCGAAGA AAGTATTATT TTGCCAGCAC TCGAACCAGG TGCTTCGATT
ACGATCGATA ACAGCCTTGA TCGCGTTATG CAAAGCAGCC CCTATCTCAA CAACAACGAC
CAATTAACCC AAGCCCTACA ATTGCTCTGG AATGCTGGCA ACGAACGCAC CAACTACAGT
GGACTGCCCA AAAATTCGCT GTATACCAAG CCGCATATCA CGGTGCTTGA TACCGAAGTG
CTCAACGAAA TTATGGTCGA TGGGGTCGCC GCTCCGCAAA AGAGCAGCAA TATCTATAAT
TTGTATGTTG ATTTGGAGCA ACGCTGA
 
Protein sequence
MAQSARWRGW FWLGLGLICL LWPNLSHAQN ADAITITIDQ VGFDAQGHVA NGGWYPIITT 
IENSGADLQA QVVVTTGFGQ ADLLQNVDLP GGSRKQVRLL MRANLNQTVV EIKVIDAQGK
QLARNRSNVR VHDSQEILVG VFGGPASSLA GASVPGRLTT VMPLDPAQLP STDGELFNFA
AIVLQDVQPT AEQAAALERW VATGGTLIVS GGPNSAELPK ELASLLPASV SRSNSSAVLT
TLNGRNTPAF AQINLRVNQL QPTADASIFG VGANNEALVI SRKLGMGQIL VTAFNPSDLP
AEVNDRLVWP VLLQPQLYRD WNVALSPWSI QIRGTDQNLP SVLGLMGILF GYILLIGPIN
YFILRRLDRR EWAWFSIPLV VLGFVGIMYL AGGDLRTGNI NVTTINIIDS QLGADQGRLS
VNYGFNAGRR GAWNGSIDAN LIAGNQPMQG FGDEGSGTIE QTNDGKTRLP NWQSNIGQMQ
TLAAIGSSAV PYNFEVKVAK ANSWEGATIT NRSERKVEYA ILFNGEESII LPALEPGASI
TIDNSLDRVM QSSPYLNNND QLTQALQLLW NAGNERTNYS GLPKNSLYTK PHITVLDTEV
LNEIMVDGVA APQKSSNIYN LYVDLEQR