Gene Haur_1518 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1518 
Symbol 
ID5733405 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1770341 
End bp1772545 
Gene Length2205 bp 
Protein Length734 aa 
Translation table11 
GC content51% 
IMG OID641278658 
Productcoagulation factor 5/8 type domain-containing protein 
Protein accessionYP_001544290 
Protein GI159898043 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000763318 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATACTC ATTTTTGGCG TAAATTGGCC GCAGTTGGCT TAGCCTGTAG CATTTTGTTT 
ACCTTTATCA CGACGGCTCC TCAAGCGGCA TTTGCCGCCG CCCCGCTTGG ACAAACGATT
TGGTTGCGGG CAATTTCAAG TGGGAAATAT GTTTCAGCCG ATGCCAATCG CGGGGCCAAT
TCGCCCTTGG TTGCTGATCG TGATACTGCC AATGGTTGGG AACAGTTTCA GGTGGTTGAT
GCTGGCAATG GCTATGTTGG CCTGCGGGCG TTGGCGACTG GCAAGTTTGT TTCAGCTGAT
CAAAATTTTG CTAACACACC ACTCGTTGCC GACCGTAACA CGATTAGCGG TTGGGAACAA
TTTCAATGGG TCGATGTTAC GGCAGGCCAA GTGCAATTAC GCTCGATTGG CAATAATAAT
TTCGTTTCCA GCGATCTTAA TCTTGGCACC CACGCACCAT TGGTTGCCAA CCGACCAACA
GCTTCGGGCT GGGAAACCTT CAATTGGGGC GTGGTTGGCA CAAATCCAAC GCCTAACCCA
ACCACCCCGC CGGGCACAAC CCCCGATTTT GGCCCGAATG TATTGATGTT TGATCCATCG
ATGTCAACTG CTTCGATTCA AGCTCAAATC AACAATGTCT ATGGCATTCA GCAAAATAGC
CAATTTGGCT CAGCTCGCTA TACCTTGATG TTCAAACCTG GCACCTATAA CGGCTTGAAC
ATTCCGGTTG GCTTTTATAC CCAATTGCTG GGCGTTGGCG CATCGCCGGA TAGCGTCAAC
ATCAACGGTA ATGTTTATTC AAATGCCTAC TTGGGCAACG ATAATGCTAC CTGTAATTTC
TGGCGCGGCG CTGAAGGCCT TGCGATTACG CCTTCGAATG GCACGATGCA ATGGGCAGTC
TCGCAGGCTG TGCCATTCCG CCGAATGCAC ATTCGCGGCA ATATGAAACT CAATCAAAAT
AATGGTTGGT CGAGCGGCGG CTGGATGTCG GATGTCTTAG TTGATGGCAA CGTCAACTCT
GGCACCCAGC AACAATGGAT TTCGCGCAAT ACCCAATGGG GCAGCTGGAC TGGCTCAAAC
TGGAATATGG TGTTTGTTGG CGTGACCAAC CCGCCAGCAG GCAGTTGGCC CAACCCACCA
TACACCAAAA TTGCCCAAAC CCCGATTGTG CGTGAAAAGC CATTTGTCAC CGTCGATGCT
GCTGGCAATT GGGGCGTGCG GGTTCCCTCG TTGCGCACCA ACAGCACTGG CATCACATGG
GCTGGTGGCT CAACGCCAGG CACGACCATC GCCATGAGCC AATTCTTTAT CGCCAAACCA
AGCGATAGTG CCGCAACGAT CAATGCCCAA TTAGCCGCAG GCAAACATCT ATTGTTTACG
CCTGGAATCT ATGCATTAAA TGATACGATT CGCGTGAATA ACCCCAATAC GGTGGTGTTG
GGCTTGGGCT TTGCAACCTT GCGCCCAACC ACGGGCTTGG CCGCGATGAC TGTCGCTGAT
GTTGATGGCG TGACGATCGC TGGCGTATTG TTTGATGCAG GCCTGATTAA CTCGCCAGTT
TTGCTGGAAG TTGGGCCAAA TGGCAGTAAT GCTAGCCATG CTGCCAACCC AATTGTCTTG
CACGACGTGA TTTTCCGCGT TGGTGGGGCC GCCGCTGGCA AAGCCTCAAC CAGCTTCCGA
ATTAATAGCC ACGACACGAT TGTTGACCAT ACTTGGGTTT GGCGGGCCGA CCACGGCGAC
GGCGTGGCCT GGAATAGCAA TACAGGTGCC AATGGAGTGA TCGTCAATGG CAATAATGTC
ACGATCTACG GCCTGTTTGT CGAACACTAT CAACAATATC AGGTACTTTG GCAAGGCAAT
GGTGGCCGCG TCTACTTCTA TCAATCGGAA ATTCCCTACG ATCCGCCAAC TCAAGATAGC
TGGCGCAGTG CAGCGGGAGT CAACGGTTGG GCCTCATACA AAGTTGCCGA TAATGTAACC
AGCCACGAAG CTTGGGGACT CGGCATTTAC AGCGTTTTCA CCAACCCTAA CATCTGGCTA
GCTCGGGCCA TCGAAGCACC AAACAACCTC AATGTTCGCT TTCACAACAT GATTTCAGTG
GCAATTGGGG CCAATGGCGG CATTAGCAAC GTGATTAATA ATACTGGTGG CTCAACCCAA
CCAAATGTCA CCTACACTCC GAAAGTAACC AATTACCCGA ATTAA
 
Protein sequence
MDTHFWRKLA AVGLACSILF TFITTAPQAA FAAAPLGQTI WLRAISSGKY VSADANRGAN 
SPLVADRDTA NGWEQFQVVD AGNGYVGLRA LATGKFVSAD QNFANTPLVA DRNTISGWEQ
FQWVDVTAGQ VQLRSIGNNN FVSSDLNLGT HAPLVANRPT ASGWETFNWG VVGTNPTPNP
TTPPGTTPDF GPNVLMFDPS MSTASIQAQI NNVYGIQQNS QFGSARYTLM FKPGTYNGLN
IPVGFYTQLL GVGASPDSVN INGNVYSNAY LGNDNATCNF WRGAEGLAIT PSNGTMQWAV
SQAVPFRRMH IRGNMKLNQN NGWSSGGWMS DVLVDGNVNS GTQQQWISRN TQWGSWTGSN
WNMVFVGVTN PPAGSWPNPP YTKIAQTPIV REKPFVTVDA AGNWGVRVPS LRTNSTGITW
AGGSTPGTTI AMSQFFIAKP SDSAATINAQ LAAGKHLLFT PGIYALNDTI RVNNPNTVVL
GLGFATLRPT TGLAAMTVAD VDGVTIAGVL FDAGLINSPV LLEVGPNGSN ASHAANPIVL
HDVIFRVGGA AAGKASTSFR INSHDTIVDH TWVWRADHGD GVAWNSNTGA NGVIVNGNNV
TIYGLFVEHY QQYQVLWQGN GGRVYFYQSE IPYDPPTQDS WRSAAGVNGW ASYKVADNVT
SHEAWGLGIY SVFTNPNIWL ARAIEAPNNL NVRFHNMISV AIGANGGISN VINNTGGSTQ
PNVTYTPKVT NYPN