Gene Jann_1850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_1850 
Symbol 
ID3934301 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp1836588 
End bp1837697 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content60% 
IMG OID637904204 
Producthypothetical protein 
Protein accessionYP_509792 
Protein GI89054341 
COG category[S] Function unknown 
COG ID[COG4260] Putative virion core protein (lumpy skin disease virus) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.129812 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0309289 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCATTA TGGATTTTCT CAAAGGTCAA TTTATTGACG TCATCGAATG GACCGATGAT 
AGCCGGGACA CGATGGTCTA TCGCTTCGAG CGCTATGGCC ATGAAATCAA GTACGGTGCC
AAGCTGACGG TGCGCGAAGG CCAGGTTGCG GTCTTCATCC ACGAAGGCCA GCTGGCCGAC
GTCTTCACCC CCGGTCTCTA TATGCTCGAG ACCAACAACA TGCCGATCAT GACGTCCCTT
CAGCATTGGG ACCATGGCTT CAGCTCTCCG TTCAAGTCCG AGATCTACTT CGTGAACACG
TCGCGGTTCA CGGACCTCAA GTGGGGCACG AAAAACCCCA TCATGATCCG CGACAGCGAT
TTCGGCCCCA CGCGCATCCG CGCCTTTGGC ACCTACACCG TGAAGGTGAA GGATGCGGGC
CTGTTCATGA CGGAAATCGT GGGCACGGAC GGCGAGTTCA CCACCGACGA GGTGACGCAC
CAGATCCGCA ACATAATCGT GCAGCAGTTC AGCCAGGCCG TCGCGGGCTC GGGCATTCCG
GTCCTTGATA TGGCGGCGAA TACCGGCCAG ATGGGCGAGG TCGTGGCCGA GAAGATCTCT
GCCACTATCG GCTCCTACGG TCTGACCTTG CCGGAGCTGT ATATCGAAAA CATCTCCCTG
CCGCCCGCGG TGGAAGAGGC GTTGGATAAG CGGACATCCA TGGGTGTTGT GGGTGACCTG
AACAAATACA CCCAGTTCCA GACGGCAGAG GCGATGCGCG CGGCCGCCGA AAACCCCGGC
GGCGGTGGCG GCATGGGCGA AGGTCTTGGC ATGGGTATGG GTATGGCGAT GGCCAACCAG
ATGGCCAATA ACATGCATCA ACCGCACCAG GCCGCGCACG CAGCCCCTCC GCCGCCCCCG
GTGGAGCATG TCTGGCACAC GGCCGAGAAC GGGGCCACGA AAGGCCCGTT CTCCAAGGCG
TCGCTCGGTC AGATGGCAAA CGACGGCTCC CTCACCCGTG ATACGATGGT CTGGACCGCA
GGCCAGGACG GTTGGAAAAA GGCCGGTGAT GTGGATGAGC TGGCGCAGCT GTTTACCGTC
ATGCCGCCCC CTCCACCGCC GCCGATGTAA
 
Protein sequence
MAIMDFLKGQ FIDVIEWTDD SRDTMVYRFE RYGHEIKYGA KLTVREGQVA VFIHEGQLAD 
VFTPGLYMLE TNNMPIMTSL QHWDHGFSSP FKSEIYFVNT SRFTDLKWGT KNPIMIRDSD
FGPTRIRAFG TYTVKVKDAG LFMTEIVGTD GEFTTDEVTH QIRNIIVQQF SQAVAGSGIP
VLDMAANTGQ MGEVVAEKIS ATIGSYGLTL PELYIENISL PPAVEEALDK RTSMGVVGDL
NKYTQFQTAE AMRAAAENPG GGGGMGEGLG MGMGMAMANQ MANNMHQPHQ AAHAAPPPPP
VEHVWHTAEN GATKGPFSKA SLGQMANDGS LTRDTMVWTA GQDGWKKAGD VDELAQLFTV
MPPPPPPPM