Gene Haur_2397 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2397 
Symbol 
ID5734278 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3053669 
End bp3055462 
Gene Length1794 bp 
Protein Length597 aa 
Translation table11 
GC content49% 
IMG OID641279538 
Productextracellular solute-binding protein 
Protein accessionYP_001545165 
Protein GI159898918 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCTAC GGTTTCCTGC GTTGATGTTG CTGGTTATTA TCTTTACCAG CATGTTGGCC 
GCATGTGGTG ATGCGTCAAC CCTCACACCA CAAGCAACTC AAGCACCAAG TACAACCGCC
CCAGCCGCTA CTGCAACAGA AGCTCCTGCC ACGGGCAGCA CCGATCCAAC CGCCGAACCA
ACTGCTGCTG CAACCACAGA TACCGGTTCC ACGAGCAGTG ATGGCAAACT ATTTTTGACG
GTTTCAGACC AACAACAATC AACTTGGGTG CGCAATTTCA ACCCATTTGC TAGTGATAAC
CGCTGGCCAA CTGCCGCCGG GATTTACGAG CCAATGTTTA TTTACAACAT TGCCACCGGT
AAAATCGAGC CATGGCTCGC CACCGAATGG GCTTGGAATA GCGATAACAC CGAGTTGACC
TTCACTATCC GCGATGGCGT GAAGTGGTCA GACGGCGAAG CATTTAGCTC AAAAGACGTT
GCCTACACGC TCAATTTGAT GAAAGAAAAC GAAAAGCTGC AAGGCAATGG CCGCGCTGCA
ATGCGTTTTG TTGATAGCGT CGCTGCCGAT GGCAATAAAG TTGTCGTGAA GTTCAGCGAA
GTTTCAACCA TCGCGCTGTA TGATATTGGC CATCAAATGA TTGTGCCAGA ACATATCTGG
AGCAAGATCG CTGATCCAGT GACCTTCACC AACGAAACTC CAGTTGCAAC CGGTCCATTC
ACCGAAATCA TCCGCTTCCA AGACCAAATC TGGGAACTTG GCAAGAATCC AAACTACTGG
CAAGCTGGCA AACCATATAT CGATGGTATT CGCCAACCAG CCTACCCCAG CAACGATGCC
GCCAACTTGG CAACGATCAA TGGCGAAAAC GACTTGGCTA GCAACTTCAT TCCTGATGTT
GAAAACACCT ATGTCGCCAA AGATCCTGAA AACAACCACT ACTGGTTCCC ACCAGTCAAT
GCGCCAGTGA TGTTGCTCTT GAACACCACC AAGGCTCCAT TCAACGATCC TAACGTGCGC
AAAGCGATCA GCATGGGCTT CGATCGCCAA CAAATTACTG AAATTGCTAC CTACAGCTAC
AACGGCCCTT CAGATGCAAC TGGTTTGCCT GAATCGTTTG CTGATTGGAA AAACCCTGAA
GCCGTTGCTG CTGGCGATTG GGTCAACTTT GATGTTGAAA AAGCCAATGC AATGTTGGAT
GCTGCTGGCC TGACCCGTGG CGCTGATGGG ATTCGCGTAT TGCCCGATGG CACGCCAATG
ACCTACGATA TCAACGTGGT TTCTGGCTGG ACCGACTGGG TGACCACCGA CCAAATCATT
GCTGAAAGCT TGAAAGAAAT CGGCATCAAC GCCACTACCC GTACCTACGA CTTTAGCGCT
TGGTTCGATA AAGTTCAAAA GGGCGAGTTT GATATGTCGA TTGGCTGGAG CAACAATGCC
CCAACCCCAC TCCAATTCTA TCGTGGCTTG ATGTCAGGTG AAACCGCCGA TACCCCAATT
GGCGAAGCCA ACGGCGACAA CTGGCACCGC TATGGTAATC CAAAGGTCGA TGAATTGTAC
TCACAATTTG TCAAAACCTC AGACCCAGCC GAACAAAAGA AAATCATGAA CGAAATTCAG
ATGATCTTTG TTCAAGAAGC CCCAGCTATC CCAATTATGC CAAACATCTA TTGGGGTGAA
TACAACACCA AACGCTTTAC CAACTTCCCG AACGAAGAAA ACCCCTATGT CTTGCTTTCA
TCGTTCGCCC AACCCGACCG CTTGATCCTC TTGACCAACA TCAAACCAAA ATAA
 
Protein sequence
MKLRFPALML LVIIFTSMLA ACGDASTLTP QATQAPSTTA PAATATEAPA TGSTDPTAEP 
TAAATTDTGS TSSDGKLFLT VSDQQQSTWV RNFNPFASDN RWPTAAGIYE PMFIYNIATG
KIEPWLATEW AWNSDNTELT FTIRDGVKWS DGEAFSSKDV AYTLNLMKEN EKLQGNGRAA
MRFVDSVAAD GNKVVVKFSE VSTIALYDIG HQMIVPEHIW SKIADPVTFT NETPVATGPF
TEIIRFQDQI WELGKNPNYW QAGKPYIDGI RQPAYPSNDA ANLATINGEN DLASNFIPDV
ENTYVAKDPE NNHYWFPPVN APVMLLLNTT KAPFNDPNVR KAISMGFDRQ QITEIATYSY
NGPSDATGLP ESFADWKNPE AVAAGDWVNF DVEKANAMLD AAGLTRGADG IRVLPDGTPM
TYDINVVSGW TDWVTTDQII AESLKEIGIN ATTRTYDFSA WFDKVQKGEF DMSIGWSNNA
PTPLQFYRGL MSGETADTPI GEANGDNWHR YGNPKVDELY SQFVKTSDPA EQKKIMNEIQ
MIFVQEAPAI PIMPNIYWGE YNTKRFTNFP NEENPYVLLS SFAQPDRLIL LTNIKPK