Gene Hneap_0363 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHneap_0363 
Symbol 
ID8533484 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothiobacillus neapolitanus c2 
KingdomBacteria 
Replicon accessionNC_013422 
Strand
Start bp370195 
End bp371868 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content60% 
IMG OID646382747 
Productprotein of unknown function DUF637 hemagglutinin putative 
Protein accessionYP_003262273 
Protein GI261854990 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAATCG AACCAAGGAT CGATCTGCTG CCCGGGTTGC CATCCGAAGC AAAACCCGGC 
AAGCCCTCGA ATCGAACGAT GGGGGCGCTC CTCACGATCG GGCTGGTCTT CGCCTGGTTC
TCCAGCCAGG GCCACATGAC CGCCGGGCAC ATCCAGGCCA ACCAAGGCGA CCTCACCCTC
GCAGCCGTTC AGGCCAAGGC CACCGGCACT TCTGAACCAT CGGACGGATC GGACGGGCCG
GTTCATTCGC CTGGGCAAAT AAGCCTCAAG GCCGCAGGCA ATATCAATCT CGCCAGCGTC
AGTACCGAAA GCTACCAGCG GACCGATGAG AAGCATAAAG ATAAAGCCTG GCAGGAAACC
CACGGTGAAG GCAATTACGA TCAGCAAACC CACTACAACC AACTAACCGC CGGACAGCTC
GATCTTCAGG CCGGTGGCAG CATCACCGCC GACATGAGCG TGCGTGACAG CGCCGCCATG
CTGGCCCAGT CACCCGACAT GGCCTGGCTG CGCCAGTTGC AACAGAATCC GAAACTGGTC
GGCAAGGTCG ATTGGCAACA GATCGAAGAA GCCCATCAAC ATTGGGACTA TAAACACCAG
GGCCTGACCC CGGCGGCATC CGCCGTCGTG GCCCTGGTTG TTGCGTACTT CACGATGGGT
GCCGGTTCGG CCATCGTCAA TACGGCTGCT GGATCCACTA CGGCTGCAGC CAGTGGCGCC
GGTGCCGTCG CGGCAGGCAT GACCCAGGCT GCGGTCAGCA CCATGGCCAG CCAAGCGGCC
GTCAGCTTCA TCAATAACGG TGGTGACCTC AGCAAGACCC TGAACGATCT GGGCAGCAGC
CAGAGCATGC GCCAACTGGC CACAGCAGTT GTCACCGCCG GGGTGCTTAG CAGTATTGGT
CAAGTCACCT TCGGCGAAGG CAAGAATGCC TTCCGGCTGA ACGATGTCAA GGTAAGCGAT
GGCCTGGTAC CGAACATCGG CAAAAACCTG ATCGACGGCG TTGCCCGAGC CACCGTCAAC
AGCGCCATCA CCGGCACCGA CCTTCAAACC AATATCCGCA CCAATGTGGT GGCTGGCATC
CTGGGTGCCG CCGAACAACA AGGTGCTAAT TGGATCGGCA ACCAGACCCT GCTGGGCGGG
GACTTCAACA CCAACGGCAA CGTCAACGAA TTCGCCCATG AATTCGCCCA TGCCATCGTC
GGTTGTGCCG CCGGAGTGGC CGGTGCCAGT GCATCGGGCA GTGGTGCCAG TACCGGTCAA
GGTTGTAGTG CTGGAGCCTT GGGTGCCGTG GTGGGTGAAC TATCCGCCCA ATTCTATGGC
GGTACCGATC CGAACCAGAC CATCGCCTTC GCCCAGATGA TGGGCGGCAT CGCCGCTGCT
GCGGCGGGGC TTGGTTCCGA AGGCGTTGCC ATCGCCGCCA ATACCGGTGC CAATGCGGCG
CAGAACAACT ACATGGCGCA TTACGACACG TATGAAGCGG ATCTGAAGGA CTGTCAGCAG
AATCCGGGCG GTGTGAACTG CGGTGCCATC TTAAGTCTGA CCGAACCCAC ATCAGTCCAA
ACCCACCAGA CCCTTTCCCC GCTGGCTCAA TGTGCGCAAG CACACCGCCT GAGCGGTCGG
CACGAGCGAA GCGATTTGCC GAGTAGGGAT TTTGCCTCAT CATTCGATAT TTAA
 
Protein sequence
MLIEPRIDLL PGLPSEAKPG KPSNRTMGAL LTIGLVFAWF SSQGHMTAGH IQANQGDLTL 
AAVQAKATGT SEPSDGSDGP VHSPGQISLK AAGNINLASV STESYQRTDE KHKDKAWQET
HGEGNYDQQT HYNQLTAGQL DLQAGGSITA DMSVRDSAAM LAQSPDMAWL RQLQQNPKLV
GKVDWQQIEE AHQHWDYKHQ GLTPAASAVV ALVVAYFTMG AGSAIVNTAA GSTTAAASGA
GAVAAGMTQA AVSTMASQAA VSFINNGGDL SKTLNDLGSS QSMRQLATAV VTAGVLSSIG
QVTFGEGKNA FRLNDVKVSD GLVPNIGKNL IDGVARATVN SAITGTDLQT NIRTNVVAGI
LGAAEQQGAN WIGNQTLLGG DFNTNGNVNE FAHEFAHAIV GCAAGVAGAS ASGSGASTGQ
GCSAGALGAV VGELSAQFYG GTDPNQTIAF AQMMGGIAAA AAGLGSEGVA IAANTGANAA
QNNYMAHYDT YEADLKDCQQ NPGGVNCGAI LSLTEPTSVQ THQTLSPLAQ CAQAHRLSGR
HERSDLPSRD FASSFDI