Gene Haur_0075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0075 
Symbol 
ID5731948 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp97351 
End bp98661 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content51% 
IMG OID641277197 
Producthypothetical protein 
Protein accessionYP_001542855 
Protein GI159896608 
COG category[R] General function prediction only 
COG ID[COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.258844 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTCCAT CGCGCCGTTT ATTAGCACTT TTGCTAGCAG GCTTGGTTCC GGTGATCATC 
GGAGCAGCAA CCCCCAGTTT GCGCTGGACA ATCTGGGTTT ATGTATTGTT GCTGATTGGC
TTTGTGGCGC TTGATTGGTT TATGACTCCC AAGCCTAAAT TGTTGGAAGT AGCGCGGATC
AACGAGCCAA AACTTTCAAT TGGCGAACAA AACCTGATCA CGCTGGCAGT GCATAATCAA
AGCCCGCGCA CGCTCGAAAT TCAAATTCGT GATGAGTTTC CGGTGGAGTT TCCCAGCGAT
ACGCTGATTC TTAAAACCAA GGTCGAGCCA GATACGGTGC AAGAGGTTAA CTACCATGTA
CGGCCCTTGC GGCGTGGTGA TTATCGCTTT GGCAATATCA ATTTGCGCTA TACCAGCACC
TTTGGTACGT TCTTGCGCCA AACCAAAATC GCCTTCGACG AATTGGTCAA GGTTTATCCC
AATGTGCTTG ATGTGCGCAA ATACGATATG TTGGCGCGGA AGGGCATGCT GTTTGAGTTG
GGGTTGCGCA CCGCCCGCGT GTTTGGCTCA GGTACCGAGT TTGAGCGCTT GCGTGAATAT
ACGCCCGATG ATGAGTTTCG CTCGATTAAC TGGAAGGCCA GCGCTCGCCG TAACAAGCTG
ATTGCTGCCG AATATGAGAC CGAGCGTTCG CAGTATGTAG TGTCGGTGAT CGACACTGGG
CGTTTGATGC GCCCAACAAT CAACGATATC GCCAAGCTGG ATTATGCGAT CAATGCCTCG
TTAATGCTGG GGTATGTGGC GATGCTCAAA GGCGATCACA TCGGCATGCT TTCGTTTGCC
GACCATGTTG GGCGTTTTTT GCAGCCGCGC CGTGGTAAAG CCCAGTTTTA TCAAATGTTG
GAGATGTTGT ATAACTTGCC ATCGCAGCCC GTCGAGGCCG ATTATGGCCG CGCGATCTCC
TACTTGGGCT TGAAGAATAA GCGCCGTTCG TTAATTGTGA TTTTTACCGA CCTCAGCACC
ATGGATACCG CCAAGCCGCT GATTCAGCAT ATGGCACGCT TGGCCAAAAC CCACCTCGCC
TTGTGTGTGG TGATGAGCGA CCCCAACTTA GTCGGCTATG CTGGCAAAGC GGCCTATAGT
TCCACCGATG TGTATGAACG CGCCGTGGCC GAGATGGTGC TTGATGAACG GCGGGTAGTG
CTCGACACGC TGAATCAAGC TGGCGTACAT ACGATCGACG TGCCAGCCAA CAAACTAACG
GTTTCGGTGA TTAATAAATA TCTGGAGTTC AAAGGGCGAG GGCTTATTTA A
 
Protein sequence
MIPSRRLLAL LLAGLVPVII GAATPSLRWT IWVYVLLLIG FVALDWFMTP KPKLLEVARI 
NEPKLSIGEQ NLITLAVHNQ SPRTLEIQIR DEFPVEFPSD TLILKTKVEP DTVQEVNYHV
RPLRRGDYRF GNINLRYTST FGTFLRQTKI AFDELVKVYP NVLDVRKYDM LARKGMLFEL
GLRTARVFGS GTEFERLREY TPDDEFRSIN WKASARRNKL IAAEYETERS QYVVSVIDTG
RLMRPTINDI AKLDYAINAS LMLGYVAMLK GDHIGMLSFA DHVGRFLQPR RGKAQFYQML
EMLYNLPSQP VEADYGRAIS YLGLKNKRRS LIVIFTDLST MDTAKPLIQH MARLAKTHLA
LCVVMSDPNL VGYAGKAAYS STDVYERAVA EMVLDERRVV LDTLNQAGVH TIDVPANKLT
VSVINKYLEF KGRGLI