Gene Haur_2268 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2268 
Symbol 
ID5734155 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2901520 
End bp2902761 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content49% 
IMG OID641279409 
Productextracellular ligand-binding receptor 
Protein accessionYP_001545036 
Protein GI159898789 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0458822 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGGCAAC AACGCTGTTG GCTTCGACGC TTAGCTTATT TGATGATCAT CGGTTTGTTA 
GGCGGTTGTA TCAGCACCAG TGCCAACCAA CAACCGCTGG TGATCACCTT TGGCGCATCG
ATCTCGATTA CTGGCAAAAC CGCCAAAGAA GGCGAATATG TGCGTGATGG GTATCAATTT
TTTGTTGATA CCCTGAATGC CCAAGGTGGG ATTCTGGTCG GCGGCCAACG CTATCAGTTG
CGTTTACGTT ATTATGATGA TGAATCGAAC CTCGAACGCA CAGCTGAGCT GTATGAAAAA
TTAATCAATC ACGATCAAGT TGATTTTTTA TTGGGGCCAT ATGGCTCGGA TGCTACCAGC
GTTGCGGTAG CGATCGCCGA AAAATATCAT ATTCCATTGG TTTCGGGCCA TGGCTCGGCC
AGCAGCATTT ATGCCAATAA CTATCACTAT ATTTTCAGTG TGCAAACGCC CGCCCGCCAC
TACTTAAACG GAGTGATGGA TGCAGTATTG GCGGCTGACC CAAGCCTCAA AACGCTGGCC
CTGTTGAGCG AAACCGATTC GTTTTCGCAG GATGTCGCCC AAGGTGTGCG TGATTACGCC
CAACAGCGCG GCTTAAACGT GGTTTATCAT GGCGATTATC CCAGCGATGC GCGTGATGTG
AGTCATCATT TAAATATCAT TAAGCAACTT CAGCCCGATA TGTTGCTCGG TGCAGGTCAT
CTGCAAGAGG CTTTGTTAAT TGTCAAGCAA GCCAAAAGCC TCGATCTTAG CCCTAAAGCA
ATTGGATTAA GTGTGGGGCC ATTATTGCCG CAATTTCGCG CTAATTTACA ACATGATGCC
GATTATATCC TTGGCCCAAC CCAATGGACT CCTGCCCTCG ACTATCATGG CGATGATAGC
TGGCAAACCC CAGCGGCTTT TGCCCAAGCC TTTCGTCAGC AATACCCCCA ATATAAATCG
GTGCCCTATC AAGTTGCTGA GTCGGCGGCA TCATTGATCG TCTTTCAACG GGCCTTTGAG
CGGGCAGGAA CGATCGATCG CTTAGCGGTG CGCGATACAA TTAAAGGCTT AAAACTTGAT
ACTTTTTTCG GGCCGATTCA ATTTGACGCG CAGGGCGTAA ACAGCGAAAA GCCCATGGCA
GTTGAGCAGT TGCATCCTGA TGGTCAAAAA TATACGGTAT TTCCCCAAGC CGTGGCCGAA
CAACCACTGT TGTATCCCAT GCCCACGTGG AGTCAACGCT AG
 
Protein sequence
MWQQRCWLRR LAYLMIIGLL GGCISTSANQ QPLVITFGAS ISITGKTAKE GEYVRDGYQF 
FVDTLNAQGG ILVGGQRYQL RLRYYDDESN LERTAELYEK LINHDQVDFL LGPYGSDATS
VAVAIAEKYH IPLVSGHGSA SSIYANNYHY IFSVQTPARH YLNGVMDAVL AADPSLKTLA
LLSETDSFSQ DVAQGVRDYA QQRGLNVVYH GDYPSDARDV SHHLNIIKQL QPDMLLGAGH
LQEALLIVKQ AKSLDLSPKA IGLSVGPLLP QFRANLQHDA DYILGPTQWT PALDYHGDDS
WQTPAAFAQA FRQQYPQYKS VPYQVAESAA SLIVFQRAFE RAGTIDRLAV RDTIKGLKLD
TFFGPIQFDA QGVNSEKPMA VEQLHPDGQK YTVFPQAVAE QPLLYPMPTW SQR