Gene Haur_3015 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3015 
Symbol 
ID5734902 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3806687 
End bp3809194 
Gene Length2508 bp 
Protein Length835 aa 
Translation table11 
GC content51% 
IMG OID641280159 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_001545781 
Protein GI159899534 
COG category[R] General function prediction only 
COG ID[COG3889] Predicted solute binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000302047 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGATTAT TTCATGGTCT TTTACTGATG AGCTTAGTCT TGCTAAGCTT CGGCGGTGCT 
GCTAGCAAAC CACCTAGCAC CACAATTCCA CCCAAGTTGT ATTTACAACG CGGCACGATC
GATCTCAAAG TGGTTAATCA AGCCAACCAG GCTGATCCAT TATTGCAGAC AGTTGGGGGG
TATGCGGTTA TTCAATTTAG TGGGCCTGTG TTGCTCAAGC AGCGGCAAGC ACTTGAAGCT
ACGGGGTTGA GCATTATTGA GTATCTCCCT GATTACGCCT ATTTAGTTCG TGGCTCAGCT
GCCCAACAAG CTGCTGCCAG CCGAATTGAT GGTTTTTATG CGCGTGGCGA TTGGACGTTG
GCCGATAAGT TTCATCCTGG GTTGTTGAAA CTGATGCGCA CTGGTGCGTA TCAAGGTTTG
GCGTTGCAGA CGATTGGTTG GGATAACCAA CTGACCACTG CTGAACAAGC AGTCAAAGCC
CAAGGCTTGA AGCTTGATGC CATTAGCACG ATCGATCAAC TGATTCAGTT GGCCCAAATT
AATGAAGTGC GCTGGATCGA AGTGGCGAGT ACGCCGAAAT TATTCGACCA ATATGCGCGG
CAAGTTCAGC AAGTTGAGCC TGTTTGGACT GATCGGCAGT TGTATGGCCA AAATCAAATT
GTGGCCTACA CTGACACAGG TTTGGATACT GGCTCCTTGA CAACCCTCAA CAACGATTTT
ACCAATCGGA TTCTTGCGAC CCAAGTGCTC TCGGCTGGGG GGCATTGGGA TGATAACCAT
GGCCATGGCA CGCACGTTGC TGGCTCGATT GCTGGTAACG GCGCACTTTC TGGCTCAAAC
CCAGCTACCC ATACCTATAC CAACTCGATG GCAGGGATTG CGCCTGAGGC CAAGTTGGTT
GTCCAAGCGT TTGAAGCAAC TGCGACCGGC GATATTATTG GCTTGCCAAC CGACCTCTAC
CCAATGTATC AGCAAGCCTA TGATGCTGGA GCACGGATTC ATAGCAATAG CTGGGGCGAT
GCAACCGGGC CAGTCAGTGA TACAGAAGCT GCATTTGGGG GCTACCCCTA TAATGCGCAA
CGCACCGACC AGTTTTTGTG GGAACACCCC GATTATACAA TGTTGTTTGG GGCTGGTAAT
AGTGGGGTCG ATGGAACCCC GAGCCAAGCA ATCTTTTGTA CTGGTGGTAA TGGCGTTGTT
GACCCGGACT CGTTGCTCGC ACCTGGGACT GCCAAGAATG TGATTACGGT TGGGGCTAGT
GAAAGCCCGC GCCCAACGGG TGGTTATACG GGTGTGCCTT GGCTTTTATT GAGTTTTTGC
TTTGCCACCG CGCCAATTAA CACGGATACT CTTTCTGATG ATGCTAATGG GATAGCGGCA
TTCTCATCAC GTGGGCCAAC TGATGATGGC CGGATCAAGC CTGATTTGGT TGCCCCTGGC
ACGAATATTC TTTCAACTCG CTCATATGGC AGTGGGGCTG GTGCCTTATG GGGTGTTCAT
GAAACCAATG CCAATTACCT TTATTCGGGT GGTACGTCGA TGTCCACTCC GTTGGTTGCT
GGTACGGTTG CCCTCATCCG CCAGTGGCTG GGTATCCAAG GCTTGCCTAA TCCTAGTGCC
GCAGTCATCA AATCAATTGT TCTGAATACG ACCGTTGATA TTGCGCCCGG CCAATATGGT
ACTGGTGCAA CTCAAGAAAT TCCTTACAAC CGCCCCAATA GTGTGGCTGG TTGGGGGCGT
AGCAATTTGA GTTTTATTAC CAAACCAGCG CCCTATCATT TGTGGGTTGC CGATCAGACG
ACTGGCTTGA ATACAGGCCA GATGGTAAGC TATAACCATA CTGCCAGCCA ACCCCTAACC
GTGTTGACCA ATACCCAACC GCTGCGAGTT ATGCTCAACT GGACTGATCC ACCAGCTTCG
TTGGCAGCCA CCCAACAATT GGTTAACGAT CTTGATTTGG TGTTGATTGG GCCTGATGGT
ACGCGCTATT ATGGCAATAA TCAGAGCACT GGCGATCGCA CGAATAATAC CGAAGGTGTG
ATCATTAATA ATCCCCAAAT TGGTGCATAT CAGATTGAAG TTACTGCCCA TAATGTACCT
ATTTCTAGCC AAGCCTATGG CTTGTCAGTC GCTGGCCCAT TGCGTGAAGC TACTGGCGGC
GGTACGCCAA CTCCAACCCC AACCGCTATA GCAACCGCAA CAAATACGCC AACCAACACG
GCGACCAATA CACCAACCAA CACGGCGACG AATACGCCAA CCAACACGCC AACCAATACG
CCAACCAATA CACCAACCAA TACACCAACC AACACGCCAA CCAACACGGC GACCAATACA
CCAACCAACA CGGCGACGAA TACGCCAACC AACACGCCAA CCAATACGCC AACCAACACG
GCGACCAATA CACCAACCAA CTCGCCAACC CCCACCAATC TGCCAACTGT GACCACGACA
GCGGTTACAA ACGAGTATGA TGTTTGGATA CCATGGGCGA GCAAATAA
 
Protein sequence
MRLFHGLLLM SLVLLSFGGA ASKPPSTTIP PKLYLQRGTI DLKVVNQANQ ADPLLQTVGG 
YAVIQFSGPV LLKQRQALEA TGLSIIEYLP DYAYLVRGSA AQQAAASRID GFYARGDWTL
ADKFHPGLLK LMRTGAYQGL ALQTIGWDNQ LTTAEQAVKA QGLKLDAIST IDQLIQLAQI
NEVRWIEVAS TPKLFDQYAR QVQQVEPVWT DRQLYGQNQI VAYTDTGLDT GSLTTLNNDF
TNRILATQVL SAGGHWDDNH GHGTHVAGSI AGNGALSGSN PATHTYTNSM AGIAPEAKLV
VQAFEATATG DIIGLPTDLY PMYQQAYDAG ARIHSNSWGD ATGPVSDTEA AFGGYPYNAQ
RTDQFLWEHP DYTMLFGAGN SGVDGTPSQA IFCTGGNGVV DPDSLLAPGT AKNVITVGAS
ESPRPTGGYT GVPWLLLSFC FATAPINTDT LSDDANGIAA FSSRGPTDDG RIKPDLVAPG
TNILSTRSYG SGAGALWGVH ETNANYLYSG GTSMSTPLVA GTVALIRQWL GIQGLPNPSA
AVIKSIVLNT TVDIAPGQYG TGATQEIPYN RPNSVAGWGR SNLSFITKPA PYHLWVADQT
TGLNTGQMVS YNHTASQPLT VLTNTQPLRV MLNWTDPPAS LAATQQLVND LDLVLIGPDG
TRYYGNNQST GDRTNNTEGV IINNPQIGAY QIEVTAHNVP ISSQAYGLSV AGPLREATGG
GTPTPTPTAI ATATNTPTNT ATNTPTNTAT NTPTNTPTNT PTNTPTNTPT NTPTNTATNT
PTNTATNTPT NTPTNTPTNT ATNTPTNSPT PTNLPTVTTT AVTNEYDVWI PWASK