Gene Cfla_2050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_2050 
Symbol 
ID9145946 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp2288564 
End bp2290309 
Gene Length1746 bp 
Protein Length581 aa 
Translation table11 
GC content70% 
IMG OID 
ProductABC transporter related protein 
Protein accessionYP_003637144 
Protein GI296129894 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.382467 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.714233 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCATGC GACTGCTCCG GGAGCACCTG CGCCCGTACC AGGGCGCCGT CGTCGCCGTG 
CTCGCCCTGC AGATGGTGCA GGTCATCGCC ACGCTGTGGC TGCCCAGCCT CAACGCCGAC
ATCATCGACG ACGGCGTCGC CCAGGGGGAC ACCGCCACGA TCTGGCGTCT CGGCGGCGTC
ATGCTCGCCG TCAGTCTCGT GCAGGTCGTC GCGTCGGTGG CCGCGGTGTG GTTCGGTGCG
CGCACCGCCA TGGCGTTCGG CCGCGACGTG CGGGCACGGC TGTTCGACCA GGTCCAGTCC
TTCTCCCAGC AGGAGATGGG GAGGTTCGGA GCACCGACGC TCATCACGCG CACCACGAAC
GACGTCCAGC AGGTGCAGAT GGTCGTCTTC ATGACCTTCG TGTTCCTGGT CATGGCTCCG
CTCATGCTGG TCGGCGGCGT GGTGATGTCG CTGCGCGAGG ACGTGGGGCT CTCCGCACTG
CTGCTCGTCG TGGTGCCGGT GCTCGCCGTG GTCATCGGGC TGATCGTGTG GCGGATGGTG
CCGTGGTTCC GGCAGATGCA GCAGCGGATC GACGCGGTGA ACCGCGTGAT GCGCGAGCAG
CTCACGGGTG TCCGTGTCAT CCGCGCGTTC GTGCGCGAGC GGCAGGAGCA GGAGCGCTTC
GAGGTCGCCA ACACGGCCCT GTACGTCGCG TCGCTGCGCG CCGGGCTGCT GTTCGCCCTG
ATGTTCCCCG TGGTGATGCT CGTGATGAAC GTCTCGAGCG TGTCGGTCCT GTGGTTCGGG
GCACAGCGGG TCGACGACGG GCTCATGCAG ATCGGGTCGC TCATCGCGTT CCTCAGCTAC
ATCATGTTCG TCCTCATGGC CGTGATGATG AGCTCGATGA TGGTCGTCAT GGTCCCGCGG
GCCATGGTGT CCGCCGACCG CATCGGCGAG GTGCTCGACA CCTCCACGAC CGTGGTGCCG
CCCACCCGGC CGGTCGCGTT CGCCGAGGGC CCGGACGCGT CGCGCGGGCG GCTCGAGCTG
CGGGACGTCG AGTTCCGGTA CCCGGGCGCC GAGCACCCGG TGCTGCACGA CGTGTCGTTC
GTGGCCGAAC CCGGCCGGAC CACGGCCGTC ATCGGGTCGA CGGGCTCCGG CAAGACGACG
CTCCTGCACC TCGTCCCACG CCTGTACGAC GTGACGGGCG GCAGCGTGCT CGTCGACGGC
GTCGACGTCC GCGAGGCCGA CCCGGTGGCG CTGGGCTCAC GCATCGGGCT GGTGCCGCAG
CGCCCGTACC TGTTCTCCGG CACCGTGCGC AGCAACCTGC AGTTCGGGCG GCCCGAGGCC
ACCGACGACG AGCTCTGGCA CGCGCTGGAG GTCGCGCAGG CGCGCGACTT CGTCGAGGCG
CTGCCCGAGG GCATCGACGC ACCGGTCGCG CAGGGCGGGA CGAACCTGTC GGGCGGGCAG
CGCCAACGGC TGGCCATCGC GCGGGCGCTC GTGCGGCGCC CGAGCATCTA CCTGTTCGAC
GACTCGTTCT CCGCGCTGGA CTACGCCACC GACGCCGCAC TGCGCGCGGC CCTGGCACCG
GAGACCCGGC GGGCGACGGT GCTCGTCGTG GCCCAGCGCG TCGCGACCAT CCGCCACGCC
GACCGCATCC TCGTCCTCGA CGAGGGACGC GTGGTCGGTG ACGGCACCCA CGACGAGCTG
CTGGCGTCGA ACGAGACGTA CCAGGAGATC GTGTACTCCC AGCTGAGCGC GCAGGAGGCG
GCATGA
 
Protein sequence
MLMRLLREHL RPYQGAVVAV LALQMVQVIA TLWLPSLNAD IIDDGVAQGD TATIWRLGGV 
MLAVSLVQVV ASVAAVWFGA RTAMAFGRDV RARLFDQVQS FSQQEMGRFG APTLITRTTN
DVQQVQMVVF MTFVFLVMAP LMLVGGVVMS LREDVGLSAL LLVVVPVLAV VIGLIVWRMV
PWFRQMQQRI DAVNRVMREQ LTGVRVIRAF VRERQEQERF EVANTALYVA SLRAGLLFAL
MFPVVMLVMN VSSVSVLWFG AQRVDDGLMQ IGSLIAFLSY IMFVLMAVMM SSMMVVMVPR
AMVSADRIGE VLDTSTTVVP PTRPVAFAEG PDASRGRLEL RDVEFRYPGA EHPVLHDVSF
VAEPGRTTAV IGSTGSGKTT LLHLVPRLYD VTGGSVLVDG VDVREADPVA LGSRIGLVPQ
RPYLFSGTVR SNLQFGRPEA TDDELWHALE VAQARDFVEA LPEGIDAPVA QGGTNLSGGQ
RQRLAIARAL VRRPSIYLFD DSFSALDYAT DAALRAALAP ETRRATVLVV AQRVATIRHA
DRILVLDEGR VVGDGTHDEL LASNETYQEI VYSQLSAQEA A