Gene Jann_3447 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_3447 
Symbol 
ID3935921 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp3492199 
End bp3495129 
Gene Length2931 bp 
Protein Length976 aa 
Translation table11 
GC content66% 
IMG OID637905821 
Productsarcosine oxidase alpha subunit family protein 
Protein accessionYP_511389 
Protein GI89055938 
COG category[E] Amino acid transport and metabolism
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0404] Glycine cleavage system T protein (aminomethyltransferase)
[COG0492] Thioredoxin reductase 
TIGRFAM ID[TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.54882 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGCTTG ATCACAAGGG CATCATCGAC CGCTCCGCCC CCGTCCGTTT CCATTTCAAC 
GGCGCGCCCT ACAGCGGCTT CAAGGGGGAC ACCGTCGCCT CTGCCCTAAT GGCCAATGGG
GTAAAGCTGG TGGGTCGGTC CTTCAAGTAT CATCGCCCGC GCGGGGTCCT GACCGCCGGA
TCGGAGGAGC CGAACGCCCT GATACAGGTC GGCGAAGGCG ACGCGATGCT GCCCAATGTC
CGCGCCACGG TGCAGGAGGT TTTCGCGGGG CTGGAGACGG CGTCACAAAA CCACCTCGGG
CCGCTGGCCT ATGACCTGCT CAGCGTCAAC GATTTGTTCC CCAACTTCCT GAGCGCGGGC
TTTTATTACA AGACATTCAT GTGGCCCCGC TCCTTCTGGG AACGTGTCTA CGAGCCCGCA
ATCCGCCGCG CCGCCGGTCT GGGGCGGATG ACGGAAGGGG CATCGCCCGA CATCTCGGAG
CGCGCCTTCG CCTTCTGCGA CGTGCTGGTG ATCGGCGGCG GCCCCACCGG CCTCATGGCC
GCGCTCACCG CTGCAATGTC TGGCGCAGAC GTCATCCTGG TCGAGGAAAC CGCCGATCTG
GGCGGGCGAG TGTTATCCGA CGGAGAGGTC ATCGACGGTC TGCCGGCGGA CGTCTGGGTG
GGCGAAACCG TGCAAAAACT GCACGCCACG GGCCGCGTGC GGATCATGAC CCGCACCACG
GCGACGGGCG TCTATGACGG TCTGACCTTT GGGGCCGTGG AACGCGTGGG CAGCCATCTG
CCCCGCGCCG ATCATCTGCC GCAGGAATGC TTCTGGCGTA TCCGGGCCGG TCAGGCGGTG
TTGGCCGCGG GCGCGCTGGA ACGGCCCATC GCCTTCCCCA ACAACGACCG ACCCGGCATC
ATGATGGCCT CGGCCATCCG CACCTATATC AACCGCTATG GCGTGACACC CGGCGAGAAG
GTCGCGATCT TCGCCAGCAA CGACGACGCC CACAAAACCG CGCTTGACCT GCAGGACGCG
GGCGTTGACG TTGCCGCCGT CATCGACAGC CGCGAAGATG CACAAGCCAT GGGCGACTAT
CGTCTGTATA CCGGCGCGCA GGTCGTGGGC AGCAAGGGAC GTCAGGCCCT GCGCGAGATC
ACGGTTCAGC GCGGCAGTTC CACCTTCAAG GTGGAGGCCG ATTGTCTTGG TATCTCAGGC
GGTTGGAACC CTACCCTGCA CCTCACCTGC CATTTGGGCG CACGGCCCGT ATGGGATTCG
GGCATCCATG CTTTCGTGCC CAAAGAGGCC GCGATAGCGG GCTTGCGCCC CGCAGGGGCC
TGCGCCGGGA CATTTTCATC CGAAGGATGC ATCAAAGATG GCATCGCCGC CGCGGCAGCG
GCGCTGAAAG CGCTCGGGTT GCGCGTGCGA AAGCCCGATC TGCCGCAATC CGAGGGCGCC
GCAGGCGCCA CCGCCCCCCT TTGGAGCGTC AGCGCAAAAG GCCGCGCCTG GCTCGACTTC
GCCAATGACG TCACCACCAA GGACGTGAAA CAATCCGCCG CCGAGGGCTT CAAATCCGTC
GAACACATGA AGCGCTACAC CACGCAGGGC ATGGCACCCG ATCAGGGCAA ATCCTCCAAC
ATAGGCGCGC TTGCCGTCCT TGCTGACGCG ACCGGCAAGG GCATTCCAGA GACCGGCACC
ACCACCTATC GCCCGCCCTT CACGCCCGTG GCGCTCGGCA CCCTGGCCGC CGGGGCGCAG
GGCAAGGGCT TCGCGCCGGA ACGCTTCACC ACCTCGGACA AGGCCGCCCG CCAGCGTGGC
GCGCCGATGA TCGCCGTGGG CCTCTGGTAT CGCGCCTCCT ACATGCCCCA GCCCGGCGAA
ACCCATTGGC GGCAATCCTG CGACCGGGAG GTCAACATGG TCCGCAATGC CGTGGGCGTG
GTCGACGTCT CCACCCTCGG CAAGATCGAG ATCTTCGGCG CGGATGCGGG CGCATTTCTC
GATTTCTTAT ATACGAACAC CTTCTCCACC CTCAAACCGG GCCGCGCGCG CTATGGTCTG
ATGCTGCGCG AAGACGGGCA TGTCATGGAT GACGGCACCA CCGCCTGCCT TGCCGACAAC
CACTACGTGA TGACCACCAC CACCGCCGCC GCCGGTCCGG TGATGGCGCA TATGGACTTC
GCGAGCCAGG TCCTGCGCCC GGATCTGGAT GTCGCCTTCA CGTCCGTGAC GGAACAATGG
GCGCAGTTCT CCGTCGCCGG TCCCCACGCC CGCACCCTGA TCAACGGCGT GCTCGACCAG
CCCATCGACG GTGACAGCTT TCCGTTCATG CAATGCGGCG CGGTCCGCGT TCACGGCGTC
CCCGGTCGCC TCTTCCGCAT CTCCTTCTCG GGCGAACATG CCTACGAGGT CGCCGTGCCC
GCGGCCTATG GCGACGCGCT CTACCGCGAC CTTGTGGCCC GCGCCGAGGC GTTGGGCGGC
GGCGCCTACG GGATGGAGGC GCTCAACGTG CTGCGGATCG AGAAGGGGTT CATCACCCAT
TCGGAGATCC ATGGCCGCGT CACCGCGTTC GATGTCGGCA TGCAGGGGAT GATGTCGAAA
AAGAAAGACT TCATCGGCAA GGCGGCGGCG ACACGTCCGG GCCTGTTGGA GGCGGATCGC
GAACGGCTCA TCGGCCTCAA ACCCACCGGG GCGGTGAAGG AGTTGACCGC AGGTGCGCAT
CTGTTCAACA CCGATGACGC TCCAACGCGG GAAAACGACC AGGGCTACGT CACCTCCGTC
GGCTATTCGC CCACCCTTGG CCATTTCGTC GGCCTTGGCT TCCTGCGTGA CGGGCCGAAC
CGCGTCGGGC ATCATATGAT GATGGTCGAC CACCTGCGGG GCGTCACGGC GGCGGTCGAA
ATCTGTGATC CGGTCTTCTT TGACCCCGAC GGAGGGCGCG CCCGTGGCTG A
 
Protein sequence
MRLDHKGIID RSAPVRFHFN GAPYSGFKGD TVASALMANG VKLVGRSFKY HRPRGVLTAG 
SEEPNALIQV GEGDAMLPNV RATVQEVFAG LETASQNHLG PLAYDLLSVN DLFPNFLSAG
FYYKTFMWPR SFWERVYEPA IRRAAGLGRM TEGASPDISE RAFAFCDVLV IGGGPTGLMA
ALTAAMSGAD VILVEETADL GGRVLSDGEV IDGLPADVWV GETVQKLHAT GRVRIMTRTT
ATGVYDGLTF GAVERVGSHL PRADHLPQEC FWRIRAGQAV LAAGALERPI AFPNNDRPGI
MMASAIRTYI NRYGVTPGEK VAIFASNDDA HKTALDLQDA GVDVAAVIDS REDAQAMGDY
RLYTGAQVVG SKGRQALREI TVQRGSSTFK VEADCLGISG GWNPTLHLTC HLGARPVWDS
GIHAFVPKEA AIAGLRPAGA CAGTFSSEGC IKDGIAAAAA ALKALGLRVR KPDLPQSEGA
AGATAPLWSV SAKGRAWLDF ANDVTTKDVK QSAAEGFKSV EHMKRYTTQG MAPDQGKSSN
IGALAVLADA TGKGIPETGT TTYRPPFTPV ALGTLAAGAQ GKGFAPERFT TSDKAARQRG
APMIAVGLWY RASYMPQPGE THWRQSCDRE VNMVRNAVGV VDVSTLGKIE IFGADAGAFL
DFLYTNTFST LKPGRARYGL MLREDGHVMD DGTTACLADN HYVMTTTTAA AGPVMAHMDF
ASQVLRPDLD VAFTSVTEQW AQFSVAGPHA RTLINGVLDQ PIDGDSFPFM QCGAVRVHGV
PGRLFRISFS GEHAYEVAVP AAYGDALYRD LVARAEALGG GAYGMEALNV LRIEKGFITH
SEIHGRVTAF DVGMQGMMSK KKDFIGKAAA TRPGLLEADR ERLIGLKPTG AVKELTAGAH
LFNTDDAPTR ENDQGYVTSV GYSPTLGHFV GLGFLRDGPN RVGHHMMMVD HLRGVTAAVE
ICDPVFFDPD GGRARG