Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_0363 |
Symbol | |
ID | 8533484 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | - |
Start bp | 370195 |
End bp | 371868 |
Gene Length | 1674 bp |
Protein Length | 557 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 646382747 |
Product | protein of unknown function DUF637 hemagglutinin putative |
Protein accession | YP_003262273 |
Protein GI | 261854990 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTAATCG AACCAAGGAT CGATCTGCTG CCCGGGTTGC CATCCGAAGC AAAACCCGGC AAGCCCTCGA ATCGAACGAT GGGGGCGCTC CTCACGATCG GGCTGGTCTT CGCCTGGTTC TCCAGCCAGG GCCACATGAC CGCCGGGCAC ATCCAGGCCA ACCAAGGCGA CCTCACCCTC GCAGCCGTTC AGGCCAAGGC CACCGGCACT TCTGAACCAT CGGACGGATC GGACGGGCCG GTTCATTCGC CTGGGCAAAT AAGCCTCAAG GCCGCAGGCA ATATCAATCT CGCCAGCGTC AGTACCGAAA GCTACCAGCG GACCGATGAG AAGCATAAAG ATAAAGCCTG GCAGGAAACC CACGGTGAAG GCAATTACGA TCAGCAAACC CACTACAACC AACTAACCGC CGGACAGCTC GATCTTCAGG CCGGTGGCAG CATCACCGCC GACATGAGCG TGCGTGACAG CGCCGCCATG CTGGCCCAGT CACCCGACAT GGCCTGGCTG CGCCAGTTGC AACAGAATCC GAAACTGGTC GGCAAGGTCG ATTGGCAACA GATCGAAGAA GCCCATCAAC ATTGGGACTA TAAACACCAG GGCCTGACCC CGGCGGCATC CGCCGTCGTG GCCCTGGTTG TTGCGTACTT CACGATGGGT GCCGGTTCGG CCATCGTCAA TACGGCTGCT GGATCCACTA CGGCTGCAGC CAGTGGCGCC GGTGCCGTCG CGGCAGGCAT GACCCAGGCT GCGGTCAGCA CCATGGCCAG CCAAGCGGCC GTCAGCTTCA TCAATAACGG TGGTGACCTC AGCAAGACCC TGAACGATCT GGGCAGCAGC CAGAGCATGC GCCAACTGGC CACAGCAGTT GTCACCGCCG GGGTGCTTAG CAGTATTGGT CAAGTCACCT TCGGCGAAGG CAAGAATGCC TTCCGGCTGA ACGATGTCAA GGTAAGCGAT GGCCTGGTAC CGAACATCGG CAAAAACCTG ATCGACGGCG TTGCCCGAGC CACCGTCAAC AGCGCCATCA CCGGCACCGA CCTTCAAACC AATATCCGCA CCAATGTGGT GGCTGGCATC CTGGGTGCCG CCGAACAACA AGGTGCTAAT TGGATCGGCA ACCAGACCCT GCTGGGCGGG GACTTCAACA CCAACGGCAA CGTCAACGAA TTCGCCCATG AATTCGCCCA TGCCATCGTC GGTTGTGCCG CCGGAGTGGC CGGTGCCAGT GCATCGGGCA GTGGTGCCAG TACCGGTCAA GGTTGTAGTG CTGGAGCCTT GGGTGCCGTG GTGGGTGAAC TATCCGCCCA ATTCTATGGC GGTACCGATC CGAACCAGAC CATCGCCTTC GCCCAGATGA TGGGCGGCAT CGCCGCTGCT GCGGCGGGGC TTGGTTCCGA AGGCGTTGCC ATCGCCGCCA ATACCGGTGC CAATGCGGCG CAGAACAACT ACATGGCGCA TTACGACACG TATGAAGCGG ATCTGAAGGA CTGTCAGCAG AATCCGGGCG GTGTGAACTG CGGTGCCATC TTAAGTCTGA CCGAACCCAC ATCAGTCCAA ACCCACCAGA CCCTTTCCCC GCTGGCTCAA TGTGCGCAAG CACACCGCCT GAGCGGTCGG CACGAGCGAA GCGATTTGCC GAGTAGGGAT TTTGCCTCAT CATTCGATAT TTAA
|
Protein sequence | MLIEPRIDLL PGLPSEAKPG KPSNRTMGAL LTIGLVFAWF SSQGHMTAGH IQANQGDLTL AAVQAKATGT SEPSDGSDGP VHSPGQISLK AAGNINLASV STESYQRTDE KHKDKAWQET HGEGNYDQQT HYNQLTAGQL DLQAGGSITA DMSVRDSAAM LAQSPDMAWL RQLQQNPKLV GKVDWQQIEE AHQHWDYKHQ GLTPAASAVV ALVVAYFTMG AGSAIVNTAA GSTTAAASGA GAVAAGMTQA AVSTMASQAA VSFINNGGDL SKTLNDLGSS QSMRQLATAV VTAGVLSSIG QVTFGEGKNA FRLNDVKVSD GLVPNIGKNL IDGVARATVN SAITGTDLQT NIRTNVVAGI LGAAEQQGAN WIGNQTLLGG DFNTNGNVNE FAHEFAHAIV GCAAGVAGAS ASGSGASTGQ GCSAGALGAV VGELSAQFYG GTDPNQTIAF AQMMGGIAAA AAGLGSEGVA IAANTGANAA QNNYMAHYDT YEADLKDCQQ NPGGVNCGAI LSLTEPTSVQ THQTLSPLAQ CAQAHRLSGR HERSDLPSRD FASSFDI
|
| |