Gene Haur_1064 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1064 
Symbol 
ID5732968 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1216770 
End bp1217891 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content49% 
IMG OID641278199 
Productextracellular solute-binding protein 
Protein accessionYP_001543840 
Protein GI159897593 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0687] Spermidine/putrescine-binding periplasmic protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACTAC GTCGCTTTGG GCTGGTGCTC ATGGTGTTGA GTTTGTTGGT GGCCTGTGGC 
GAAGCCAGCT CAACCGCTGT TCCGACAACT GCGCCTGCTG GTACGGCCAC TGGCTCGAAT
GCTGGCTCGG TTGATCGCAG CAAATTGAGC AAAACCTTGC GAATTTATGC GTGGGGCGAG
TATGTTCCCG AAGATGTGCC ACAACTGTTT GAGAGTGAGT TTGGGGTCAA GGTTACGGTT
GATACCTATT CATCGAACGA AGAAATGGCC GCCAAAATTC GCGCTGGCAA TTCAGGCTAC
GATTTGATTC AACCTTCAGA TTATATGGTG GCGTTGTTGG CCGAGGGCAA TTATTTGGCC
AAAATTGATT TGGCTAATAT TCCCAACATT GCCAATATCG ACCCAGCCAA TATGGGTTTG
TATTACGACC CAAACAATGA ATTTTCAGTA CCATACCTTT GGGGAACCAC GGGGATTGCC
TACGATAAAA CCGCAGTTTC ACCAGCCCCA ACTAGCTGGT CGATTTTGTT TGATCCAGCG
CAATTGAGCG CCTACAAAGG TCGCGTGAGC ATGCTCAACG ACGAGCGCGA GGTAATTGGC
GCGGCTATGC TGTTCTTGGG CAAAGATCGC AATTCCAGCG ATGCGGCGGA TCTCGAAGCC
GCCAAAAAGG TTTTGATTGA ACAAAAGCCA TTGCTAGCCA AATATAACAG CGATAATGTT
TATCAAGATT TGGCTTCGGG CGAAGTGGTT TTGGCCCAAT CGTGGAATAA TTACACGGGT
TTGGCCATGA TCGATAATGA AAACATCGAG TGGGTGATTC CGCAAGAGGG TGGCGTGATT
TGGCAAGATA CCATGGCGAT TGTGGCTGGC ACGCCCAACC AATATACTGC CGAAGTATTC
ATTGATTTTA TGAATCGGCC AGAAATTGCC GCCAAAGTTG CCGACTTTAC TGGTGCTTTG
ACTCCCAACG TCAAGGGCGA ACCATTGATC GGCGACGATC TCAAGGCTGT CTATCCCAAG
ATCAAACCGA GCGCTGAAGA TCGCAAACGG CTTGATTGGT TGCGTAAAGG CCAAAATGCG
ACGGCCTTCT CCGATGTGTG GTCGGCGGTT AAATCGCAAT AA
 
Protein sequence
MKLRRFGLVL MVLSLLVACG EASSTAVPTT APAGTATGSN AGSVDRSKLS KTLRIYAWGE 
YVPEDVPQLF ESEFGVKVTV DTYSSNEEMA AKIRAGNSGY DLIQPSDYMV ALLAEGNYLA
KIDLANIPNI ANIDPANMGL YYDPNNEFSV PYLWGTTGIA YDKTAVSPAP TSWSILFDPA
QLSAYKGRVS MLNDEREVIG AAMLFLGKDR NSSDAADLEA AKKVLIEQKP LLAKYNSDNV
YQDLASGEVV LAQSWNNYTG LAMIDNENIE WVIPQEGGVI WQDTMAIVAG TPNQYTAEVF
IDFMNRPEIA AKVADFTGAL TPNVKGEPLI GDDLKAVYPK IKPSAEDRKR LDWLRKGQNA
TAFSDVWSAV KSQ