Gene Cfla_1967 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_1967 
Symbol 
ID9145861 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp2188983 
End bp2190641 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content69% 
IMG OID 
ProductSSS sodium solute transporter superfamily 
Protein accessionYP_003637061 
Protein GI296129811 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.732438 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGTT ACGACGTCGT CCTCGCAGCG ACCGCCGACG GCGGGCAGGT CGGGGACCCC 
GTGGTCAACA TCGCCATCTT CGGTGCGTTC GTCCTGGTGA CGCTCGTCAT CGTGTTCCGC
GCGTCGCGCA ACAACAAGAC GGCCGCCGAC TACTACGCGG CCGGTCGCTC CTTCACCGGT
CCGCAGAACG GCACCGCGAT CGCGGGCGAC TACCTGTCCG CCGCGTCGTT CCTCGGGATC
TGCGGCGCTA TCGCGATCTA CGGCTACGAC GGGTTCCTCT ACTCCATCGG GTTCCTCGTC
GCGTGGCTCG TGGCGCTCCT GCTCGTGGCG GAGCTCCTGC GCAACACCGG GCGCTTCACG
ATGGCCGACG TGCTGTCGTT CCGGCTCCGC CAGCGCCCGG TGCGCCTCGC GGCAGCGATC
TCGACGCTCG CGGTCGTGTT CTTCTACCTG CTGGCGCAGA TGGCCGGCGC GGGCGGCCTC
GTCGCGCTGC TGCTCGGCAT CGACGGTGTC GCGGGCCAGG GTCTGGTCAT CGCCGTCGTG
GGCGCGCTGA TGATCCTCTA CGTCCTGGTG GGCGGCATGA AGGGCACCAC CTGGGTGCAG
ATCATCAAGG CGATCCTGCT CATCGCGGGC GCCGGGATCA TGACGATCTG GGTGCTCGCG
AGGTACGGGT TCGACCTGTC GGCGCTCCTG CAGGGCGCGA TCGACGCCGG GGGCGAGGAA
GGCAGCAAGC TCATCGAGCC GGGCAAGCAG TACGGCGCGT CCGCCCTGAC GCAGCTCAAC
TTCCTGTCGC TGGCCCTCGC GCTGGTCCTC GGCACGGCCG GTCTGCCGCA CGTGCTCATG
CGCTTCTACA CGGTGCCGTC CGCCAAGGAG GCCCGGCGCT CGGTCGTCTG GGCGATCTGG
CTCATCGGGA TCTTCTACCT GTTCACGCTC GTGCTGGGCT ACGGCGCCGG AGCGATCGTC
GGCCCGGAGA CGATCATGGC CTCGCCCGGC AAGGCGAACT CGGCGGCACC CCTGCTCGCC
TACGAGCTCG GCGGGGTCTT CCTGCTCGGC ATCATCTCGG CCGTCGCGTT CGCCACGATC
CTCGCGGTCG TCGCCGGCCT GACGATCACC GCCGCGGCGT CGTTCGCGCA CGACATCTAC
GCGTCGGTCA TCAAGAAGGG GCAGGTCGCC CCCGACGGCG AGGTGCGCGT CGCGCGGATC
ACGGTGCTGG TCATCGGTGG TCTGGCCATC GTCGGCGGCA TCTTCGCCAA CGGCCAGAAC
GTCGCGTTCC TCGTGGCCCT CGCCTTCGCG GTCGCCGCGT CGGCCAACCT GCCGACGATC
ATCTACTCGC TGTTCTGGAA GCGGTTCAAC ACCGCCGGCG CGCTGTGGAG CATGTACGGC
GGGCTCATCT CGTGCGTGCT GCTCATCGCC TTCTCGCCGG TCGTGTCCGG CAAGGTCGAC
CCGACCACCG GTGCCAGCCT GTCGATGATC CGCGACACGT CCATCGACTT CGCGATCTTC
CCTCTCGAGA ACCCGGGCAT CATCTCGATC CCGCTCGCGT TCCTGCTCGG GATCGTCGGG
ACGCTGCTGT CCAAGGAGCA GCCGCACCCG GAGAAGTTCG CCGAGATGGA GGTCCGCTCG
CTCACGGGTG CCGGGGCCGA GAAGGCCTCG GTGCACTAG
 
Protein sequence
MSRYDVVLAA TADGGQVGDP VVNIAIFGAF VLVTLVIVFR ASRNNKTAAD YYAAGRSFTG 
PQNGTAIAGD YLSAASFLGI CGAIAIYGYD GFLYSIGFLV AWLVALLLVA ELLRNTGRFT
MADVLSFRLR QRPVRLAAAI STLAVVFFYL LAQMAGAGGL VALLLGIDGV AGQGLVIAVV
GALMILYVLV GGMKGTTWVQ IIKAILLIAG AGIMTIWVLA RYGFDLSALL QGAIDAGGEE
GSKLIEPGKQ YGASALTQLN FLSLALALVL GTAGLPHVLM RFYTVPSAKE ARRSVVWAIW
LIGIFYLFTL VLGYGAGAIV GPETIMASPG KANSAAPLLA YELGGVFLLG IISAVAFATI
LAVVAGLTIT AAASFAHDIY ASVIKKGQVA PDGEVRVARI TVLVIGGLAI VGGIFANGQN
VAFLVALAFA VAASANLPTI IYSLFWKRFN TAGALWSMYG GLISCVLLIA FSPVVSGKVD
PTTGASLSMI RDTSIDFAIF PLENPGIISI PLAFLLGIVG TLLSKEQPHP EKFAEMEVRS
LTGAGAEKAS VH