Gene Haur_1814 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1814 
Symbol 
ID5733672 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2109001 
End bp2110665 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content47% 
IMG OID641278957 
ProductFG-GAP repeat-containing protein 
Protein accessionYP_001544585 
Protein GI159898338 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000122942 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGACCA CAATGGTTTT TCGGCGTGCC GTGCTGGCGA GCATCGGTAC AATTGTCGTG 
AGCAGTTTAT TGGTAGTTAA TGCCAATAGC CGAGGCTTGC TGGCCCAAAC GCTGCCTAAT
CCAATTGATT TTCATCCAGT GGTGACCTAT AGTTCACCAA GCCGCCCATG CGAATTGGGT
CGTGGTGATT TCAATGGCGA TGGTTTTGTC GATCTAGCGA CGGCAAATCA GGCAAGTAGC
GAAGTTCAAA TATTTTTAAA TAATGGAGCC GGCGCTTTCC CAACGCACAC CACGTATAGT
GTTGCTACTC CTTGTGGCAT CGATGTCGGT AATGTTGACG GTGATAACGA TTTAGATATT
GTCGTGACCA AGCAAACCTC AAATCAACTA GGTGTATTGC TCGGTAATGG CGATGGCACG
TTCCAAATTG CCCAATCGTT TAGCACTGGT GCACGCCCAA CCGACGTAAT CTTGCGTGAT
CTAAATCAAG ATACTGAACT TGATGCGGTT ATAACCAACC AAGATAGTCA AGGGGTTAGT
ATTTTGTTGG GCAATGGCAA TGGAACCTTT GCTAATCAAA CGATCTACAC GGTTCAGGCT
TCGCCAACGC TTGAGGCAGT TGGTGATTTA ACTGGCGATG GGTATGCTGA TGTGGTTGTG
GCCAATGCTG GTAGTGATAG TGTCAGCGTG TTGATTAATA ATGCCAATGG TACGTTTAGT
TCTGCGGTTC ATTATGGGGT TGGCAATATT CCCCATAGCG CTGGGATTGG CGATATTGAT
GGCGATAATG ATAACGATAT TATCGCGGTT AATCGTTGGG AACAAAGTAT GACTCGCTTG
ATCAATAATG AGAGTGGTAG CTTTACGCCG CTTGCTCCAA CGATTTTCTT GCAAGGCCCA
AGCGATATTG AAATAACTGA TCTTGATGGA GATGGCGTGC TGGATATTCT GGCAACCAAT
ACGGTTAATG ATGTTGATCC TGGCACGGTC AGTATTTATT TTGGCTTAGG CAATGCCAAT
TTTAGCAGCC CACAACTGGT AACCTCAGGC GTGCACCCAA CGTCGTTAAT CTATGCCGAT
TTGAATAATG ATGGTTTGGT TGATATTGCG ACTTCAAACT TTTATGGCAA TAGTATCAGC
GTTTTGTTAC GGCGAGTTCC TGCTGCAACC AGCACGCCAA CCATAACCCC GACGGCCACG
AGCACACCAA CCGCAACCAA TACACCAACG GCCACGCCAA CGAGCCAACC AAGCGTGACT
CCGGTTGCTG GTAGTTCAAC GACCTTTTTG CCGTTGGTTA CTGATAGTCG CCCAATCTTC
CCAATCGTGA TTAATGCCGT GGCTCAGCCG TTGATTCCAA TCACTCAACA AGGCCAAATT
TATTACACCA CGACATTAAC AATCAATACT CCATTGCCAA CGACTGGGCG CTTCTATCTT
TCATCGCGTC CTGACGCGAT TGCCGAAGTT CGGGTTGATG ATCAAATGAC GGTTTGGGCT
GATAATGCGG TGTTGTACGA ACGTAGCTTA ACAACGCCCC AAGTTGTTGA GATTTCGCGC
AGCGAGTTGA CATCATGGCT TGATCAAGAG CTAACCATCA CCTTCCGCGA TGTAGCTGGC
TCGGTTTACG GCAATAGCGC GGTGTGGTTG ATTTGGGTTC CTTAG
 
Protein sequence
MKTTMVFRRA VLASIGTIVV SSLLVVNANS RGLLAQTLPN PIDFHPVVTY SSPSRPCELG 
RGDFNGDGFV DLATANQASS EVQIFLNNGA GAFPTHTTYS VATPCGIDVG NVDGDNDLDI
VVTKQTSNQL GVLLGNGDGT FQIAQSFSTG ARPTDVILRD LNQDTELDAV ITNQDSQGVS
ILLGNGNGTF ANQTIYTVQA SPTLEAVGDL TGDGYADVVV ANAGSDSVSV LINNANGTFS
SAVHYGVGNI PHSAGIGDID GDNDNDIIAV NRWEQSMTRL INNESGSFTP LAPTIFLQGP
SDIEITDLDG DGVLDILATN TVNDVDPGTV SIYFGLGNAN FSSPQLVTSG VHPTSLIYAD
LNNDGLVDIA TSNFYGNSIS VLLRRVPAAT STPTITPTAT STPTATNTPT ATPTSQPSVT
PVAGSSTTFL PLVTDSRPIF PIVINAVAQP LIPITQQGQI YYTTTLTINT PLPTTGRFYL
SSRPDAIAEV RVDDQMTVWA DNAVLYERSL TTPQVVEISR SELTSWLDQE LTITFRDVAG
SVYGNSAVWL IWVP