Gene Jann_0221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_0221 
Symbol 
ID3932658 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp216104 
End bp217804 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content61% 
IMG OID637902563 
Productglycosyl transferase family protein 
Protein accessionYP_508163 
Protein GI89052712 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.780402 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACGC TTGGGTTCGT GATGATGTGC CATACCGCGC TGGACCGTGC GGCAGAAGTG 
GCGCGCCATT GGGCGGAACG CGATTGTCCC GTGGTGATCC ACCTCGATAA ACGCGTCGCG
AAAGAGCGGG ACGTTTGGCT GAAGCGAGAG CTCTCGGACC TGGACAACGT CAAATTCTGC
AAGCGCCACA AGTGCGAATG GGGCACGTGG AGCCTTGTGC AGGCGAGCCA GACCGGCGCC
GAGATGATGT TGGAAGAATT TCCCGGCGTG CGGCATGTCT ATCTGGCGTC CGGGTCCTGC
CTGCCCCTGC GCCAGGTGGA TGAGTTGCGC GCTTATCTGG CCGCGCGACC GATGACGGAT
TTCATCGAAA GCGTTACCAT CCGCGATGTG GATTGGACTG TGGACGGCCT CAACACCGAA
CGCTTTCAGT TCCGGTTTCC GTTCTCCTGG AAGCGGCACA GGTTCCTGTT TGATCGCTAC
GTGGAGCTTC AGCGCAGCAC AGGCTTCAAA CGCAAACTGC CTGATGGGTT GCAACCGCAT
CTGGGCAGCC AATGGTGGTG CCTGTCACGA CAGACCTTGT CGGCCATTTT GCAAGATCCG
GCCCGGCGGG AGTTTGACCG TTACTTCAAG CTGGTCTGGA TCCCGGACGA ATCCTATTAT
CAGACCCTCG TGCGCAATTA CTCCCGCAAC ATCGAAAGCC GGTCGCTGAC CCTGTCGAAG
TTTGATTTCC AGGGCAAACC GCACGTGTTC TATGACGATC ATCTGCATTT GCTGCGCCGG
TCCGATTGCT TTGTGGCGCG CAAGATCTGG CCCCGCGCCG ACCGGCTTTA CCGCACGTTC
CTGAACGATG ACGCGGCGGA TGCACGCAAG GCGGAGCCGA CACCTGGCAA GATCGACCGC
GTTTTCGCCA AGGCGCTGGA GCGTCGGACA CGCGGGCGGC CCGGCCTGTA TATGCAGTCG
CGGTTTCCCA ACAATGACTG GGAGAATGGG AAGACCGCCG CGCCATATTC CGTGCTCCAC
GGCTTCAATG AGTTGTTTCT GGACTTTGAG CCATGGCTGT CGCGACGGCT TGGCACCTCT
GTGCATGGCA ACATCTTCAG TCCCAAGCGG GTGCGCTTTG CGGGGGATGC GGATACGTTT
GCCGGGTGCC TCTCCGCCGC GCCGGAATTG CGGGACTATA ACCCGGAGCG GTTCCTCACC
AGTCTGATCT GGAACACCCG CGGCGAACGT CAGGTGTTCA TGTTCGGGCC GGAAGACAAC
CAACGGCCCG GGGCATTCAT GGCGGCTGAC AGCAATGCCC AGATCAGTGT CATTACCGGC
GCGTGGGCGG TGCGTCTGTT CACCGCCAAC CGCAATTTCG GCGATATCCG CACGGAGGCC
GCGCGCCTAC AGCGGCGTGA GGTCGAGTTC GTGAACACGC TGCGCCACGT CCGCTCCCGC
GCTGAAATCC GGATCTGGTC CCTGGCGGAG TTTCTTGAAG AGCCGATGGA AAACCTACAA
CGGATTCTGG ATGCGATGGA GGGCGCGCAG GCCAGCCGCC TGACCGAAGC GCCGAAGATG
GCCAATCTGG CGGGTTTGCC GAAATTCCTT CAAAACCTGA AAAACCAGGG AATGAACCCC
TTTGCCGTGG GTGATTTCCC GCAAGAAGGC GTCGCGCCGC TTCCTGCCGG TGCCGACAAC
CGCCCATATC TGGTGAGATA G
 
Protein sequence
MSTLGFVMMC HTALDRAAEV ARHWAERDCP VVIHLDKRVA KERDVWLKRE LSDLDNVKFC 
KRHKCEWGTW SLVQASQTGA EMMLEEFPGV RHVYLASGSC LPLRQVDELR AYLAARPMTD
FIESVTIRDV DWTVDGLNTE RFQFRFPFSW KRHRFLFDRY VELQRSTGFK RKLPDGLQPH
LGSQWWCLSR QTLSAILQDP ARREFDRYFK LVWIPDESYY QTLVRNYSRN IESRSLTLSK
FDFQGKPHVF YDDHLHLLRR SDCFVARKIW PRADRLYRTF LNDDAADARK AEPTPGKIDR
VFAKALERRT RGRPGLYMQS RFPNNDWENG KTAAPYSVLH GFNELFLDFE PWLSRRLGTS
VHGNIFSPKR VRFAGDADTF AGCLSAAPEL RDYNPERFLT SLIWNTRGER QVFMFGPEDN
QRPGAFMAAD SNAQISVITG AWAVRLFTAN RNFGDIRTEA ARLQRREVEF VNTLRHVRSR
AEIRIWSLAE FLEEPMENLQ RILDAMEGAQ ASRLTEAPKM ANLAGLPKFL QNLKNQGMNP
FAVGDFPQEG VAPLPAGADN RPYLVR