Gene Hneap_1103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHneap_1103 
Symbol 
ID8534251 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothiobacillus neapolitanus c2 
KingdomBacteria 
Replicon accessionNC_013422 
Strand
Start bp1192748 
End bp1194049 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content54% 
IMG OID646383488 
Producturea ABC transporter, urea binding protein 
Protein accessionYP_003262986 
Protein GI261855703 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence
[TIGR03407] urea ABC transporter, urea binding protein 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCGTC GTAACTTTAT GAAAGCCGTG GGCGCTACCG GATTGTCACT TAGTTTCGGC 
CTCAAGGGCA TCGAGTTTGC CAATGCCGCA GAAGGCCCAA TCAAGGTCGG TATTCTGCAT
TCGTTATCCG GCACGATGGC CATCTCTGAG AGCGCACTCA AAGACAACAT GCTGATGTTG
ATTGCCGAGC AGAACGCCAA AGGTGGTGTG ATGGGGCGCA AATTAGAGGC AGTCGTCGTC
GATCCCGCAT CCAATTGGCC CCTGTTCGCG GAAAAGGCAC GCGAACTGAT CAGCAAGGAC
AAAGTTTCCG CCATTTTCGG ATGCTGGACT TCCGTTTCCC GCAAATCCGT CCTGCCTGTC
GTCGAAGAAC TCAATGGCCT GCTGTTCTAT CCCGTTCAGT TTGAAGGCGA AGAATCCTCG
CGCAATATTT TCTACACCGG CGCGGCGCCA AACCAGCAAG CCATTCCGGC CGTTGATTAT
TTGATGAACG AGCTTGGCAT TACCCGTTGG GTATTGGCGG GCACCGATTA CGTGTATCCG
CGCACCACCA ACAAAATCCT GGAAGCCTAC CTCAAGCAAA AAGGTGTGAA GGACGAAGAC
ATCATGATCA ACTACACGCC GTTCGGTCAG TCCGACTGGC AGTCGATCGT GAGCCAGATC
AAGCAATTCG GCAGCGCGGG CAAACCCACC GCAGTTGTTT CCACCATCAA CGGCGATGCC
AACGTGCCGT TCTATCGCGA ACTGGGCAAT CAGGGCATTC AGTCTCAGGA TATTCCCGTC
GTTGCCTTCT CCGTGGGCGA ACAGGAACTC TCCGGCATGG ATACCAAACC ATTGGTCGGC
CAACTGGCGG CCTGGAATTA CTTCGAAAGC GTCAAAACGC CAGAAAATGA AGCCTATATC
GCCAACTGGA AGAAGTTCAA GAAAGACCCT AAAGCGGTGA CCAACGACCC GATGGAAGCG
GAGTACATTG CCTTCCAGAT GTGGGTGAAG GCGGTTGAGA AGGCCAAATC CACCGACACG
GACAAGATCC TCGAATCGAT CATCGGCGTG GAAGTGCCCA ATCTGACCGG CGGCACGGCC
AAAATGCTGC CCAATCACTA CATCACCAAG CCGGTGTACA TCGGGGAAAT TCAGGATGAT
GGACAGTTTG ATGTCGTCTG GAAAACCAAG ACCGAAGTGC CGGGCAAGGC ATGGTCCCCG
TACCTGCCCG GCAGCAAGGA TCTGATTGCG GACTGGACAC CGCCGATCAA CTGCGGCGCG
TACAACACCG TCACCAAGAA GTGCACAGGA TCAGGTTCAT AG
 
Protein sequence
MNRRNFMKAV GATGLSLSFG LKGIEFANAA EGPIKVGILH SLSGTMAISE SALKDNMLML 
IAEQNAKGGV MGRKLEAVVV DPASNWPLFA EKARELISKD KVSAIFGCWT SVSRKSVLPV
VEELNGLLFY PVQFEGEESS RNIFYTGAAP NQQAIPAVDY LMNELGITRW VLAGTDYVYP
RTTNKILEAY LKQKGVKDED IMINYTPFGQ SDWQSIVSQI KQFGSAGKPT AVVSTINGDA
NVPFYRELGN QGIQSQDIPV VAFSVGEQEL SGMDTKPLVG QLAAWNYFES VKTPENEAYI
ANWKKFKKDP KAVTNDPMEA EYIAFQMWVK AVEKAKSTDT DKILESIIGV EVPNLTGGTA
KMLPNHYITK PVYIGEIQDD GQFDVVWKTK TEVPGKAWSP YLPGSKDLIA DWTPPINCGA
YNTVTKKCTG SGS