Gene Jann_2804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_2804 
Symbol 
ID3935270 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp2816562 
End bp2817554 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content65% 
IMG OID637905171 
Productallophanate hydrolase subunit 2 
Protein accessionYP_510746 
Protein GI89055295 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1984] Allophanate hydrolase subunit 2 
TIGRFAM ID[TIGR00724] biotin-dependent carboxylase uncharacterized domain 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.776372 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGTGG CCCGGTTCAA GGTGCTGTTC GCTGGTCCGT TGGTGTCGTT CCAGGATGGC 
GGCCGCTTCG GCCACATGCG CTTTGGCGTG CCCGCGTCCG GCCCGATGGA TCGGTTCGGC
TTCGCCGCCG TCCATGCGAT GTTGAACCAG CCCGAGGCCG CGGCAATTGA GATTTCCCTT
GGCGGTTTGG TTCTGGAATG TGTCGACGGC TCCGTGACCT GCGCCGTGGC GGGTGGGGCT
TTCAGCCTCA CAACGCAGGC GTCTGGATGG CAGGTCGCGA CCGTCCACGC CGGAGATAAA
CTGACCCTCC GCGCAGGCGA TTGGGGCAGT TGGGCCTACC TCGCATTCTC TGGGGAAATC
TCATGCGATC AATGGCTTGG CTGTTCGGCC ACCCACGCGC TCTCGGGCCT TGGCGGTGGT
AGCCTGCGCA CCGGCGACGT GTTTGAGGTT CGCGATTGTG CCCCGCGCCC GACCCGCGAA
GGGGCCTATG ACGCCCCCGA CATCGCGCGG CCCGTGGCGG ACATTCGCGT GATCATCGGC
CCCCAGGATC AGCATTTCGC CCCCGATGCG CAGGACATTT TGACCGCTGC ACCTTACACA
CTGACCGATG CGTTTGACCG GATGGGCGTG CGGCTCGACG GCACTGTGCT TCCTTTGGGC
GATGCGCTCT CAATCCCGTC TGAGCCGATC TTGCGCGGGT CTATTCAGGT GGCGGGCGAT
GGCGTGCCCG TGGTCCTCCT GGCCGATCAC CAAACCACTG GCGGCTATCC CAAGATTGCG
ACCGTGCTGT CCACCGACAC GGACCGCCTG GCGCAACTCC GTGCGGGCGA CAGCTTGCGG
TTTCACGCGA TTTCCGCCGC GGACGCCGTT CTCGCTGTCC GCCATGATCA CGCAGCCCGC
ATCGAGGCCC TGTCAGATCT CGCCGCACCC CGAGCCTCGC TCAGCCAAAA GTTGATGCAA
ACCAATCTGA TCAGCGGCGT CACCGGAGAT TGA
 
Protein sequence
MSVARFKVLF AGPLVSFQDG GRFGHMRFGV PASGPMDRFG FAAVHAMLNQ PEAAAIEISL 
GGLVLECVDG SVTCAVAGGA FSLTTQASGW QVATVHAGDK LTLRAGDWGS WAYLAFSGEI
SCDQWLGCSA THALSGLGGG SLRTGDVFEV RDCAPRPTRE GAYDAPDIAR PVADIRVIIG
PQDQHFAPDA QDILTAAPYT LTDAFDRMGV RLDGTVLPLG DALSIPSEPI LRGSIQVAGD
GVPVVLLADH QTTGGYPKIA TVLSTDTDRL AQLRAGDSLR FHAISAADAV LAVRHDHAAR
IEALSDLAAP RASLSQKLMQ TNLISGVTGD