Gene Jann_4041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_4041 
Symbol 
ID3936529 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp4143307 
End bp4145061 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content67% 
IMG OID637906426 
Productheparinase II/III-like 
Protein accessionYP_511983 
Protein GI89056532 
COG category[S] Function unknown 
COG ID[COG5360] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGAGC CCCTTCCCCC GTCCCCCGCA AGGCCGCGCC CGCGTGACGG GCTGGCCGCG 
CGCATCCGCC GTGAGTGGGC GGCGCGTAGG GCGGGGTTGG GCCCCCGGGC GCAGGGGTTC
CTGTGGCAAC CGGAACCCCG GTTTCCCGGG TCTGCGGTCC GGGGGCGGCA ATTGTTGGCA
GGCAATTTCC GCCTGGGCGG GGCATTGGTT GAGATTGACG GGATCAGCCC CTGGGACATC
ATTCCGCCCA ACGACGAGTT TGAGGCTGCG CTGCACGGAT TTGCCTGGCT CGACGATCTG
GTGGCCGTCC CCAACAACGA GGGCCGCGCC ATGGCACAGC GATGGCTGGC CGAGTGGACG
ACCCGCTATG GCAAGGGTCG GGGACCGGGG TGGAGCGCGG ATCTGACAGG CCGCAGGCAG
ATCCGCTGGA TCACCCACAC CCTGTTTTTG ATGAACGGCC AGGCCCCGGC GGACGGCAGG
TTGTTCCACC TCGCGCTGTC GCGGCAAGCC AACTACCTTG CCAGACACTG GCGGCGGGCA
TCACCCGGTT TGGCGCGGTT CGAAGCGCTG ACCGGTCTGA TTTACTCTGC CTGCGCACTG
ATCGGGATGG AGACGCGGCT TGAGCCCGCG CTGACGGGCC TTGCGCAGGA TTGCGCAACA
CAGATCGACG CCCAGGGCGG CATCGTCACG CGCAACCCGG AAGAATTGCT GGAGGTGTTC
GTGTTGCTGA CCTGGATCGC CCAGATCCTG CAGGAAACCG GCAAACGCGC CGACCCGGCA
GTGGATACCG CGATCATGCG CGTGGCCCCA ACCCTGAGGG CGTTGCGCCA TGCCGACGGC
AGTCTTGCGC GGTTCCACGG CGGCGGTCGC GGCGCGCCGG GGCGGCTGAT CGGGGCGTTG
GTGCAATCCG GCGTGCGACC ATCGCGGGTG CGGGGGCTGG CAATGGGCTA TGCGCGTATG
GCGTCGGGCC GGGTGACGAT AATCACCGAC GCGGCCCCGC CGATGATCGG CACGGGCTCC
ACCAATGCCC ATGCGGGCAC GCTGGCGTTT GAGATGTGTT CGGCCAATCA CCCGCTGATC
GTCAATGCCG GGTCCGGTGC CAGTTTTGGG CCGGAATGGC GGCGCGCGGG TCGCGCCACG
GTCAGCCATT CCACCGTGTC TCTGGAAGGA TATTCGTCAT CGCGCTTTGC CGAGAAGGCG
CTGCATGATC CCCCCGAACG CCAGAGTTTT GAGAACGGCC CCAGCGACGT GTCGGTGCAG
GTGTCCGAGA TCACCGGCGG TGAAGGGCTG GTGTTGTCCC ACGATGGCTG GCGCAAGACC
CACGGGCTGG TGCATCTGCG CTCTCTCACG CTGGAGGATA ACGGCAATCT TCTGCGCGGC
GAAGATGGCC TCGCCGCATT GGATGGCCAT GACCGGGATC GCTTCATGCG CGTCAATCGC
AGCTTGCCCT CTGACGTCGG TCTGCGCTTT GCCGCGCGCT TCCATCTGCA TCCCGATGTG
GTGGTGGAGT TGGATATGGG CGGGGCCGCG ATTTCGCTGA CATTGCCCAC CGATGAGGTC
TGGGTGTTCC GCCACGGCGA CGAGGCCGAG CTGTCGATCC GCCCTTCCGT CTATTTCGAT
GCAACGCGCC TGAAACCCCG CGCGACAAAA CAGATTGTTT TAACCTCCCG CGTCAGGGGG
TATGGAGCGG CAGTCAGCTG GTCCATCGCG CGCCCCTCGG CGCTGTTGCC CGCCCCCGAC
GACCTGTCTT TGTGA
 
Protein sequence
MSEPLPPSPA RPRPRDGLAA RIRREWAARR AGLGPRAQGF LWQPEPRFPG SAVRGRQLLA 
GNFRLGGALV EIDGISPWDI IPPNDEFEAA LHGFAWLDDL VAVPNNEGRA MAQRWLAEWT
TRYGKGRGPG WSADLTGRRQ IRWITHTLFL MNGQAPADGR LFHLALSRQA NYLARHWRRA
SPGLARFEAL TGLIYSACAL IGMETRLEPA LTGLAQDCAT QIDAQGGIVT RNPEELLEVF
VLLTWIAQIL QETGKRADPA VDTAIMRVAP TLRALRHADG SLARFHGGGR GAPGRLIGAL
VQSGVRPSRV RGLAMGYARM ASGRVTIITD AAPPMIGTGS TNAHAGTLAF EMCSANHPLI
VNAGSGASFG PEWRRAGRAT VSHSTVSLEG YSSSRFAEKA LHDPPERQSF ENGPSDVSVQ
VSEITGGEGL VLSHDGWRKT HGLVHLRSLT LEDNGNLLRG EDGLAALDGH DRDRFMRVNR
SLPSDVGLRF AARFHLHPDV VVELDMGGAA ISLTLPTDEV WVFRHGDEAE LSIRPSVYFD
ATRLKPRATK QIVLTSRVRG YGAAVSWSIA RPSALLPAPD DLSL