Gene Haur_3541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3541 
Symbol 
ID5735400 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4456122 
End bp4457477 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content49% 
IMG OID641280688 
Producthypothetical protein 
Protein accessionYP_001546305 
Protein GI159900058 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000377942 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGTTTTAC GTCGGATTTT AGGCATCGTC ACTGCGGCTA TCGCTGTCGT GAGTTTAGCT 
GTTCAACCTC GTTCAACTGC GGCTGCAGAA CCAATCAAAC TTGATACCCA ATGCTTTGAT
GTACCAGGGA TTATCAACTG TTTAGATGAT AAATTTCTGA GCTATTGGCG CAGCAATGGT
GGTTTGCCAG TGTTTGGCTA CCCAATTACT GCGGCTGCCA ATGAAGTTAA CCCCGATACC
CAGCAAAGCT ACTTGACACA GTGGCTCGAA CGTAATCGGT TTGAATTACA CCCCGAAAAT
GCAGGTACGC CTTACGAGGT GTTGTTGGGC TTGTTGGGCA AAGAACGCTT GACCCAACTT
GGCCGTGAAA TTGAGCCTCG CGAAGCTGGC CCAGTCGATG GCTGCTTATG GTTCGAACAA
ACTGGCCACA ATGTCTGTGA TCAAGCAGGT AGTTTGGGTT TCAAGAGCTA TTGGCAATCG
CATGGCTTGA AAATTGATGG CCTAGACAAT TATGCTCGTT CATTGCAATT GTTTGGTTTG
CCCTTGACCA GCGCTAAGAG TGAAACCAAT GCCAACGGCG ATACAGTTGT CACCCAATGG
TTTGAACGTG CTCGCCTCGA ATGGCACCCA AGCAATCCCG ATGAATTCAA GGTGCTCTTG
GGCTTGCTCG GTAAAGAAAT TATCGATGGC CGTAGCCAAC CAACTCCACC AACGCCAATT
GATCCTTGTG CTTCAACCCC TGATCCAGTG TCAGCTCGCG TGCGCCCAGC CAAATGTGGT
GAGCAAGGCA CCGAGTTTTC GTTTGATTTC TATGGTTTCA AGGCCAGCGA AGAGGTTGGC
TTCTGGATTA CCAATCCCGA TGGGATCAAT GTTGGGACAC GGCAAACAGC GAAGGTTGGC
CCAAACGGGA GCATTAGTGG CCTCCCATTC GATAGCCGCG ATGCCACACC TGGCACTTGG
CAATTTACCA TGCAATCAGC CTATCAAAGC CATCAGGCTA TTGTCTATAT TACAGTAATT
GCCAAGGCTC CCCAGCCAAC CCCAAACCCA AGCAACTGTA CTAGCACGCC TGAACCAGTT
TCAGCGCGAA TTAGCCCAGC AAAATGTGGT CCAGCAGGCA TGGTCTTTAT CTTCGATGTA
TTTGGGTTCC AACCCAACGA ACAAGTTGGC TTCTGGATCA CTAATCCCGA CGGAATTAAT
GTTGGGATTG CCAATACTAT GAATATTGGC CCCGAAGGTG CAATCTCAGG GATCGAGTTC
CCAACTGATG GTTTTACTCC TGGCACATGG CAATTTACCA TGCAAGGGAC GACCAGCAAT
CACGCTTCAA TCATCTACTT TACGATTACC GAATAA
 
Protein sequence
MVLRRILGIV TAAIAVVSLA VQPRSTAAAE PIKLDTQCFD VPGIINCLDD KFLSYWRSNG 
GLPVFGYPIT AAANEVNPDT QQSYLTQWLE RNRFELHPEN AGTPYEVLLG LLGKERLTQL
GREIEPREAG PVDGCLWFEQ TGHNVCDQAG SLGFKSYWQS HGLKIDGLDN YARSLQLFGL
PLTSAKSETN ANGDTVVTQW FERARLEWHP SNPDEFKVLL GLLGKEIIDG RSQPTPPTPI
DPCASTPDPV SARVRPAKCG EQGTEFSFDF YGFKASEEVG FWITNPDGIN VGTRQTAKVG
PNGSISGLPF DSRDATPGTW QFTMQSAYQS HQAIVYITVI AKAPQPTPNP SNCTSTPEPV
SARISPAKCG PAGMVFIFDV FGFQPNEQVG FWITNPDGIN VGIANTMNIG PEGAISGIEF
PTDGFTPGTW QFTMQGTTSN HASIIYFTIT E