Gene Jann_2003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_2003 
Symbol 
ID3934456 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp2005022 
End bp2007007 
Gene Length1986 bp 
Protein Length661 aa 
Translation table11 
GC content62% 
IMG OID637904359 
Productglycosyl hydrolase, BNR protein 
Protein accessionYP_509945 
Protein GI89054494 
COG category[R] General function prediction only 
COG ID[COG4447] Uncharacterized protein related to plant photosystem II stability/assembly factor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.075614 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.402986 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCAGC CCGCCGATAT ACATCCCAAG CCAGAACCCG CCACCTCGCC CGCGCCCCCG 
GTGTCGCCGC CCGCCGATAT TCCGTTCCCC TTCGCAACAC TCGTCGAGAA CCAGCGCAGC
CGAGCTCACA ATGCCACGCG CCGCGCCCAG GGCATCTTTG CCGCACTAAT CGCCGTTATC
CTACTAGGTA TCGGGTACTA CGTAGGCCTG CCGTTGTACC TCCAATGGAA CGAGTCGTCA
CGCAGCGTCT ACGAAGCTGA GGCGGGTACG CTGTCCTCGC AGTTGGACCG GATGGATGAG
GCAAGGGACC GCATATGGAC AGCTCTGTCC GAAGAGCTGA AGGGAATCGG AACTCCTCGC
CCCTCTGGCG TCGACGTAAA CCTGAATGAC GTCCGTAGTC TGCCCGACGG TACTATCTTG
GTCGCCGTGG GGCGGCACGG CACCGTCATA CGGTCCACCG ATGCCGGCAA CACTTGGACA
CCTCGTCGCT CCGGCGTGGA TATTGATCTA TTCAACCTCC GTGTTCTGCC CGACAGCACC
ACCCTGGTCG CTGTGGGAGA GGGCGGCACC GTCATACGGT CCACTGATGC TGGCAACACT
TGGCAGTCCC GCCCTTCCGG CGTCAATAAC AGTCTGTACG ACTTACGCGT GCTACCCAAC
AGCACCACCC TAATCGCTGT TGGACGAAGC GGTGCAGTCA TACGGTCCAC TGATGCTGGC
AACACTTGGA TGCCCCGCCC CTCCGGCGTG GATGTTGATC TATACAACCT TGGCATTCTG
GCGGACGGCG CCACCTTGGT CGCTGTAGGC GAAGACGGCA CTGTCATACG CTCGACCGAT
GCCGGCGAAA CTTGGACATT CCTTCCCTCC GTCGTGGATG CCGATCTGTT CAACCTCGGC
ATTCTGCCCG ACAGCACCAC CTTAGTCGGG GTAGGAGAGA GCGGCACCGT CATAAGGTCT
ACCGATACCG ATAGCACCTG GATGCCCCGC CCCTCCGGCG TCATTGCGGA TTTGTATGTC
CCTCGCGTGC TGCCCGACGG CGTTACTCTG ATCGCGGTGG GATCAAGTGG CGGAATCATA
CGCTCGACCG ATGCCGGCAT GACATGGACG CCCCGCCCCT CCGGCATCGA TGGGAATCTG
TTCAACTTCA GCGTTTTGCC CGACAGCGGC ATCCTGGTGG CGGTAGGGTC AGATGGCGCA
GTCATACGCT CGACCGATGC CGGCGTGACA TGGACGCCCC GCCCTTCCGG GCAAGATATT
ACCCTAAGAG ATATCCGCGT TCTACCCGAT GGTACGACAC TGATTGCGAT AGGTTTTTTT
GGCGCGATTG TGATTATCGA CGACCGTTAC GCCGATGCCC TCGCCGCGAT CGGACCTCTC
TCCGGCTCCC TCGGCGATAA TGCCTACCGC AGCGGCATTG CCGGGCTCCC GGAGTATGTT
CGCAACCACC CCGTCGTTGG GGCCTTACTC GCCGAACTGG ACGGCACCAT CGAAGGTCGT
GCCGACCTCG AAACCCGCCT TCAATCCGCC CGCGCCTCCG CCGACGAAAT CAGGACAGGC
GGCTTTTCTC TCGCTCAACG GCGCCAGGAT TTCGAGGAGT TCATGCGCGT ATGTACAGCC
GACCTGTCAG ACGAGGCAGA GGGCGTGGGC ACCGAACACT GCACCCGCGC CTATGTCGAC
CTTCGCCAAG CCGAAAGCCA GACGGTGTGG GAGATCCTTG CCGAACGGGC GCCGCAGGCC
ATCTTGCTGT TGTTCCTCCT CGCAACGCTT GCGGCACTTT ATCGGTACAA CATGCGCCTT
GCGGGTTTCC ACGCCGCGCG AGCCGACGCG CTGCATCTCT ACGCCATGGG CCGCACCCAC
GACCCCGCCA TTCTGACGGA GTTCTCGGAC GCACTAGCCG CCGATAAGGT CGAGTTCGGC
AAGGGGAACA CACCGTCGGA GCAGGCTGTC GAGATCGCAA AGGCTATGGT CGGGCGGCGT
GGGTAA
 
Protein sequence
MNQPADIHPK PEPATSPAPP VSPPADIPFP FATLVENQRS RAHNATRRAQ GIFAALIAVI 
LLGIGYYVGL PLYLQWNESS RSVYEAEAGT LSSQLDRMDE ARDRIWTALS EELKGIGTPR
PSGVDVNLND VRSLPDGTIL VAVGRHGTVI RSTDAGNTWT PRRSGVDIDL FNLRVLPDST
TLVAVGEGGT VIRSTDAGNT WQSRPSGVNN SLYDLRVLPN STTLIAVGRS GAVIRSTDAG
NTWMPRPSGV DVDLYNLGIL ADGATLVAVG EDGTVIRSTD AGETWTFLPS VVDADLFNLG
ILPDSTTLVG VGESGTVIRS TDTDSTWMPR PSGVIADLYV PRVLPDGVTL IAVGSSGGII
RSTDAGMTWT PRPSGIDGNL FNFSVLPDSG ILVAVGSDGA VIRSTDAGVT WTPRPSGQDI
TLRDIRVLPD GTTLIAIGFF GAIVIIDDRY ADALAAIGPL SGSLGDNAYR SGIAGLPEYV
RNHPVVGALL AELDGTIEGR ADLETRLQSA RASADEIRTG GFSLAQRRQD FEEFMRVCTA
DLSDEAEGVG TEHCTRAYVD LRQAESQTVW EILAERAPQA ILLLFLLATL AALYRYNMRL
AGFHAARADA LHLYAMGRTH DPAILTEFSD ALAADKVEFG KGNTPSEQAV EIAKAMVGRR
G