Gene Slin_4254 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4254 
Symbol 
ID8728013 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5134332 
End bp5136194 
Gene Length1863 bp 
Protein Length620 aa 
Translation table11 
GC content51% 
IMG OID 
ProductCarbamoyltransferase 
Protein accessionYP_003389037 
Protein GI284039107 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.0843761 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATAT TAGGCATCTC TGCTTTTTAT CACGACTCGG CAGCAGCATT AATTGAAGAC 
GGAAAAATTA TTGCCGCTGC TCAGGAAGAA CGATTTACCC GAAAAAAGCA CGATCCAGGG
TTTCCGGCTG AAGCAATAAA ATATTGTCTT CAGTACAGTG GAACAGACCT CAATAAACTC
GACGCTATTG TATTCTACGA TAAACCGCTA CTCAAGTTTG AGCGGTTACT GGAAACGTAT
TATGCCTTTG CGCCCCAGGG GCTTCGGTCG TTCTTAATGT CGATTCCAGT GTGGCTCAAA
GAAAAGCTGT TTTTGAAACG GCTTATTCGG GAGGAACTGG AAAAACTAGG CTACAAGTCG
GCCAGTAAGG TTAAACTGCT TTTTCCTGAA CACCATCTTT CGCACGGAGC CAGTGCGTTT
TACCCCTCCC CGTTCGAACG GGCTGCTATT CTGACCATTG ATGGCGTGGG CGAATGGGCC
ACCGCTTCGA TTGGCCTGGG TGAAGGGAAG AATATTTCTA TTCTGAAAGA AATGCGATTT
CCGCACTCGC TCGGCTTACT GTATTCGGCT TTTACCTATT TTCTGGGTTT TCGGGTCAAT
TCGGGGGAGT ATAAATTAAT GGGGCTTGCT CCCTATGGCG ACCCCAACTC GCCCGATGTT
GCCCGCTACA TGGCCACCAT CAAAGAAACG CTGGCCGACC TTCGGCCTGA TGGGTCGATC
TGGCTCAACC AGGATTATTT CGACTATGCC ACGGGCCTGA AGATGGTGAA CGAAGGCAAA
TGGGCTGAAT TATTCGGGTT TCCCAAGCGC CAGCCGGAAG ATGAACTGCT GCCTCAACAT
TGTAATCTGG GCCTGGCTAT CCAGTACCTG ACCGAAGAAG TGGTTCTGAA CATGGCCAAA
GAAGCCAAAC GGCTTACCAA TGCCGATGCG CTGGTATTGG CGGGTGGTGT GGCCCTGAAC
TGCGTATCGA ACGGAAAGCT TCAGGCAGCG GGTCTTTTCA AGGACATATT CATTCAGCCA
GCCGCTGGTG ATGCCGGTGG TGCCCTAGGT GCAGCCCTGG CCGCTTACCA CATTTACTTC
GGCAAAGAGC GGGTCGTAAC GACCGAGCGC GATGCCATGC GGGGTTCTTA TCTGGGGCCA
ACCTTTTCGG ATCTGGATGT GGAGCTGATG GCAAAGAAAT ACAAAGCCGT AGCTACTCAT
TACACCGATT TTACCGAACT GAGCCGCGAT GCCGCCAAAC TTCTGGCCGA AGGCAACGTA
CTGGGCTGGG TGCAGGGACA GATGGAGTTC GGTCCGCGTG CATTGGGTGG CCGCAGCATT
CTGGGTGACC CCCGCAATGC CGAAATGCAG AAAAAGCTGA ACCTGAAAAT CAAATACCGC
GAATCGTTCC GGCCGTTTGC GCCGTCGGTA CTGGCCGAAG ATTGTGCCGA ATATTTCGAT
TATGACGGTA TTTCGCCCTA TATGCTGCTC GTTCATCCGG TAGCGCAAAA ACGGCGCACA
CCCGTCCCGG CCGATTACGC CAGCTTTCCA CTCCGCGAGA AGCTGTATTA CCAGCGCTCC
GATCTACCTT CTATTACGCA CATCGACTAC TCGGCCCGGA TTCAGACAGT TCATAAAGAT
ACCAACCCGC GCTATTACCA GTTAATAAAT GCCTTTAAAC AGCTGACCGG TTATGGTGTT
ATCGTTAATA CCAGCTTCAA CGTGCGGGGT GAGCCTATTG TATGTACCCC GGATGATGCG
TACCGGTGTT TCATGCGTAC CGAAATGGAC TATCTGGCCG TTGGCAATTA CCTGTTCGAC
AAACGGCAGC AGCCTGAGTG GCAGGAGAAG GATAACTGGA AGGAAGAATT TGTTCTAGAT
TAA
 
Protein sequence
MTILGISAFY HDSAAALIED GKIIAAAQEE RFTRKKHDPG FPAEAIKYCL QYSGTDLNKL 
DAIVFYDKPL LKFERLLETY YAFAPQGLRS FLMSIPVWLK EKLFLKRLIR EELEKLGYKS
ASKVKLLFPE HHLSHGASAF YPSPFERAAI LTIDGVGEWA TASIGLGEGK NISILKEMRF
PHSLGLLYSA FTYFLGFRVN SGEYKLMGLA PYGDPNSPDV ARYMATIKET LADLRPDGSI
WLNQDYFDYA TGLKMVNEGK WAELFGFPKR QPEDELLPQH CNLGLAIQYL TEEVVLNMAK
EAKRLTNADA LVLAGGVALN CVSNGKLQAA GLFKDIFIQP AAGDAGGALG AALAAYHIYF
GKERVVTTER DAMRGSYLGP TFSDLDVELM AKKYKAVATH YTDFTELSRD AAKLLAEGNV
LGWVQGQMEF GPRALGGRSI LGDPRNAEMQ KKLNLKIKYR ESFRPFAPSV LAEDCAEYFD
YDGISPYMLL VHPVAQKRRT PVPADYASFP LREKLYYQRS DLPSITHIDY SARIQTVHKD
TNPRYYQLIN AFKQLTGYGV IVNTSFNVRG EPIVCTPDDA YRCFMRTEMD YLAVGNYLFD
KRQQPEWQEK DNWKEEFVLD