Gene Slin_2843 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_2843 
Symbol 
ID8726593 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp3438076 
End bp3439200 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content53% 
IMG OID 
Productoxygen-independent coproporphyrinogen III oxidase 
Protein accessionYP_003387656 
Protein GI284037726 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.225724 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.624237 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATCTCT ATCTTCACAT ACCGTTCTGC AAACAGGCCT GCCATTATTG CGACTTTCAT 
TTCAGTACAA GTCTGGGGCA GAAGTCGGCC CTGGTTGATG CGCTTTGTAC GGAAATTGCC
CTGCAAAAAA GCTACCTGCC CGATCAGGCA CTGGAAACGA TTTATTTTGG CGGAGGCACA
CCTTCCCTGT TGACAGAGGC TGAGTTAGCA CAGGTTTTTA CCGCTATTCA CGCTCATTTT
TCAGTTTCAC CAACCGCCGA AATCACCCTT GAAGCCAACC CCGATGATCT TGATTCTGGG
AAGCTTCAGA TGTTGCGTCG CTATGTTAAC CGGCTCAGTA TTGGGATTCA AACGTTCGAT
GAAACTTCGC TCCGCTGGAT GAATCGTGCG CATACTGCTA CGGAAGCCGA AGTATGCGTT
GGGCTGGCGC GGCAAGCGGG ATTTGAGAAT ATGAGCGTTG ACCTGATTTA CGGAATTCCG
AATAGAGACA ATCAGGCGTG GCAGCTTGAT TTGCAAAAAA TACTGGCACT CAACGTGCCC
CACCTCTCGG CTTATGCCCT GACGATTGAA CCCGACACCG CTTTTGGGCG CTGGCAGAAA
AAAGGGAAAT TGCCTCCTGC TGATGAGGAC ACTGCCGCCG GTCAATTTGA GGAGTTAACA
AGTGCGCTGA CAACGGCAGG CTACGCCCAC TACGAAATAT CGAACTTTGC GCGGGATGGC
CAGTACGCCC GGCACAACAC GGCTTATTGG CAACGGCGGC CGTACCTGGG CATCGGGCCA
AGTGCTCATT CGTACAATGG CCACTCGCGA CAATATAACC TGGCCAATAA TGTCCGTTAC
ATCGCAGCGA TTGCCCAGGG AAAACTTCCC GCCACGCTGG AGGAACTGAC CGTTGCCGAT
CAGGTCAATG AATACCTGCT GACTGGTTTG AGGACCCAGT GGGGCTGCTC CCTGACGGAA
CTTGATACGA TGCTGGCGGG CGATTTTTCG ACGATGCAGG CCCGCGATCT GGCCGCTATG
TACAAAACCG GCTGGCTGGT TCGGCATGGC GATACACTTC TGCTGACTCA GCCCGGTAAA
CTCTTTGCTG ATCGGGTGGC CGCTTCCTTA TTCGTAGACG CGTAA
 
Protein sequence
MHLYLHIPFC KQACHYCDFH FSTSLGQKSA LVDALCTEIA LQKSYLPDQA LETIYFGGGT 
PSLLTEAELA QVFTAIHAHF SVSPTAEITL EANPDDLDSG KLQMLRRYVN RLSIGIQTFD
ETSLRWMNRA HTATEAEVCV GLARQAGFEN MSVDLIYGIP NRDNQAWQLD LQKILALNVP
HLSAYALTIE PDTAFGRWQK KGKLPPADED TAAGQFEELT SALTTAGYAH YEISNFARDG
QYARHNTAYW QRRPYLGIGP SAHSYNGHSR QYNLANNVRY IAAIAQGKLP ATLEELTVAD
QVNEYLLTGL RTQWGCSLTE LDTMLAGDFS TMQARDLAAM YKTGWLVRHG DTLLLTQPGK
LFADRVAASL FVDA