Gene Jann_3830 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_3830 
Symbol 
ID3936310 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp3923441 
End bp3925339 
Gene Length1899 bp 
Protein Length632 aa 
Translation table11 
GC content62% 
IMG OID637906208 
ProductBeta-galactosidase 
Protein accessionYP_511772 
Protein GI89056321 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1874] Beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.247674 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCTCC AGCGGACGCT TGGCACCTGT TACTACCCCG AGCATTGGCC TGAGAAGATC 
TGGGCGGAGG ATGCTGCGCG CATGGCCGTC CTTGGGCTGA CCTGGGTCCG TATTGGAGAG
TTCGCCTGGA GCCGGTTGGA ACCCTCCCCC GGCGATTTGC AGTTCGACTG GCTCGACCGT
GCGATTGACG TGCTGCATGC GGCAGGCCTC AAGGTTGTGC TTGGCACCCC GACCGCCACG
CCGCCCCGTT GGATGGTCGC GCGGCACCCC GATATGCTTG CGGTTGATGA CCAGGGTAGG
CCTCGAAAAT TCGGCTCCCG CCGCCATTAC TGTTTTTCCC ATGACGGCTA CCGCGAAGAA
GGTGTCCGGA TCACTGGCCT TCTGGCCGAG CGGTTTGGGC GCAAGGTGGA CGCATGGCAG
ACCGATAACG AATATGGCTG CCATGACACG GTCTATAGTT ACTCGGACGC CGCCCAAACT
GCGTTCCGGG AGTGGCTGGC GCAGAAATAT CAATCCCCAG ATGCGCTGAA CCGCGCTTGG
GGCAATGTCT TCTGGTCGAT GGACTATGAC AGTTTTGACG ATATCGACTT GCCGAACCTG
TCGGTGGCAG AGCCCAACCC GTCCCACGCG CTGGACTTTC GCCGCTTCTC GTCCGACCAG
GTCGCCCGCT TCAATGGCGC GCAAGTCGCC GCGATCCGTG CCCATTCCGA CGTGCCAATC
AGCCACAATT ACATGGGTCG GATCGTCGAA TTCGATCACT TCGCCACCGG ACGCCAGATG
GAAATCGCGA CGTGGGACAG CTACCCGTTG GGCTTTCTGG AGGACCGCCT GGAAAGCTCG
CCTGAACACA AGCGCCGTTA TGCCCGCCAG GGTGATCCGG ATTTTCAGGC CTTCCACCAC
GATCTTTACC GGGCTGTCGG ACAAAATGGC CGCTGGTGGG TGATGGAGCA ACAGCCCGGC
CCCGTGAACT GGGCCCCTCA CAATCCCGCG CCGCTGCCTG GCATGGTGCG CTTCTGGACG
TGGGAGGCCT TCGCCCACGG CGCCGAATGT ATCGCGTATT TCCGCTGGCG GCAGGCCCCT
TTCGCGCAGG AGCAATACCA TGCGGGCCTT CTCCGCCCCG ACAATGTTGA AGCGGAGGGT
TTTGCCGAGG CCGCGCAGGT CGCCCGCGAA TTGCAGGACA TGCCAGCGGT CGAACACACC
CAAGCCCCGG TGGCCCTCGT GTTCGACTAC GCCAGTGCCT GGGGCTGGAG CGTTCAGCCG
CAAGGCGCAG GCTGCGACTA CTTTCGTTTG GTGTTTGAGG TGTATCGCGG CCTGCGCAAA
CTGGGTCTTT CTGTCGATAT CCTTCCGCCG GATGCCAAGA CGCTGAACGG CTACGCCCTC
GTCCTCGCCC CCGGCGTTCT CACCCTGTCC GACGCGCTGA AGGCCGCCCT ATCCGAGACG
ACAGCCCAGG TCCTTCTCGG ACCGCACACA AATGCAAAGA CACCGGCGCT TGGCATCCCG
GTCCCGTTAC CTCCTGCGAT TTCAGGGCTG TCTGCCATCG TCGCCCGTGT GGAAACCCTC
CGCAACGATA TGCCGATCCC GGCCGAAAAT GGCGGCTCTG TGAAGCTGTG GTTTGAGCAT
CTTGAGACGA CGCATGATGC GGTTGAGAAA ACAGAAGCAG GCGCGCCCAT TCTCGTTCAA
TCCGCCAACC TTTCGTATCT TGCGGGTTGG CCCGATGAGA TGCTGCTGGA GCGGGTTCTG
ACCCGGGCTT CCTCTGCCGC CGGTCTCTCC CCCGAACCCT TGCCCCAGGA CATCCGTATC
CGTGACACCG GCACCACGCG GTTTGTGTTC AATCACGGCC CCGATGCCGT CGCCTACAAT
GGCCAAAGCA TCCCACCAGC TGGTGTTACT TGGGACTGA
 
Protein sequence
MTLQRTLGTC YYPEHWPEKI WAEDAARMAV LGLTWVRIGE FAWSRLEPSP GDLQFDWLDR 
AIDVLHAAGL KVVLGTPTAT PPRWMVARHP DMLAVDDQGR PRKFGSRRHY CFSHDGYREE
GVRITGLLAE RFGRKVDAWQ TDNEYGCHDT VYSYSDAAQT AFREWLAQKY QSPDALNRAW
GNVFWSMDYD SFDDIDLPNL SVAEPNPSHA LDFRRFSSDQ VARFNGAQVA AIRAHSDVPI
SHNYMGRIVE FDHFATGRQM EIATWDSYPL GFLEDRLESS PEHKRRYARQ GDPDFQAFHH
DLYRAVGQNG RWWVMEQQPG PVNWAPHNPA PLPGMVRFWT WEAFAHGAEC IAYFRWRQAP
FAQEQYHAGL LRPDNVEAEG FAEAAQVARE LQDMPAVEHT QAPVALVFDY ASAWGWSVQP
QGAGCDYFRL VFEVYRGLRK LGLSVDILPP DAKTLNGYAL VLAPGVLTLS DALKAALSET
TAQVLLGPHT NAKTPALGIP VPLPPAISGL SAIVARVETL RNDMPIPAEN GGSVKLWFEH
LETTHDAVEK TEAGAPILVQ SANLSYLAGW PDEMLLERVL TRASSAAGLS PEPLPQDIRI
RDTGTTRFVF NHGPDAVAYN GQSIPPAGVT WD