Gene Jann_2093 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_2093 
Symbol 
ID3934546 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp2101395 
End bp2102576 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content65% 
IMG OID637904449 
ProductVWA containing CoxE-like 
Protein accessionYP_510035 
Protein GI89054584 
COG category[R] General function prediction only 
COG ID[COG3552] Protein containing von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.602608 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGGG TGACGAAATT CGCCGGGCGC GATCCCGGTC CTGCGGCGCG GGTGGCGGGT 
TTCATCGCGC ATCTGCGTGA AAACGGGCTG CGTCTGGGCG TGGCGGAGGC TGATCTGGCC
ATGGCCGCCC TCACCCATGT AAACGCCATA CAGCCCGACG ACAGCCGCCG CGCGCTACGC
GCCGTGTGCA CCGGCTGTAA AGAGGAGGCG GAGCGGTTCG ACGACCTCTT TGACAGCTAT
TGGATGGACA TGGGCCGGGT TAAATCCAAG GTGATCCCGA CGCCATCCAG CACCATCAGC
GACGATGTCC ATTCATCTCG TGATGCCAAA GGTGAGGACG CGAGCGCCTC CGGCTCGGCC
ACGGCCCCCG ACGACCAGGA TGGTGCGGCA GACAGTGATG GTACAGGCAA GCTGATCGCG
ACCGAGCAAC GCAACCTGAG CCGGCGTGAC TTACGTGATC TGGTGCGGCC AGAAGAAATC
GCCGAGGCCG AGGATGTCGC CCGCCGCCTG GGAGCGGCCC TCCGCGATGC CCGCTCCCGT
CGCCGCATCG CTGCCCGTAA GGGCGACAGG CTGCACTTCC GCAAGACCAT CCGCCACAGC
CTTGCCACCG GGGGCGAGCC CCTGCGCCTG CTACGCAAGA AGCGCCCGGA CCGGACCCGC
AACATCGTGG CGCTGTGCGA TGTCTCAGGC TCCATGTCCG TCTACGCCAA GGTGTTTCTC
GCCTTCCTCG CCGGACTGAT GCGGGCCGAC ACCGCCGCCG ACGCCTACCT TTTCCACACC
CGCCTTGTGC GGATCACCGA GGCGTTGCGC GATAAAGATG CGATGCGCGC GATTGGGCGG
ATGTCCCTGA TGGCCGACGG CTTTGGCGGC GGCTCCAAGA TCGGCCCGTC GCTGATGCGC
TTTGCCGACA CCTATGCAAA ACGCTTCGTC AATGGCCGCA GCGTCGTGCT GATCCTGTCG
GACGGCTATG ACACGCAAGC GCCCGATATG ATTGCCGGGG CGCTGGCCAA GCTGCGCAAG
CGGGGCTGCA AGGTGATCTG GCTCAATCCT CTGAAAGGCT GGTCAGATTA CGCACCGGTG
GCCGAGGGCA TGGCCGCTGC CCTCCCCTAT CTGGATGCCT TCAAGGCGGC CAATACGCTG
GCTGACCTTG CGGCCTTGGA ACAGGAGCTG GCGCGCGTAT GA
 
Protein sequence
MSRVTKFAGR DPGPAARVAG FIAHLRENGL RLGVAEADLA MAALTHVNAI QPDDSRRALR 
AVCTGCKEEA ERFDDLFDSY WMDMGRVKSK VIPTPSSTIS DDVHSSRDAK GEDASASGSA
TAPDDQDGAA DSDGTGKLIA TEQRNLSRRD LRDLVRPEEI AEAEDVARRL GAALRDARSR
RRIAARKGDR LHFRKTIRHS LATGGEPLRL LRKKRPDRTR NIVALCDVSG SMSVYAKVFL
AFLAGLMRAD TAADAYLFHT RLVRITEALR DKDAMRAIGR MSLMADGFGG GSKIGPSLMR
FADTYAKRFV NGRSVVLILS DGYDTQAPDM IAGALAKLRK RGCKVIWLNP LKGWSDYAPV
AEGMAAALPY LDAFKAANTL ADLAALEQEL ARV