Gene Cfla_1004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_1004 
Symbol 
ID9144879 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp1113392 
End bp1114987 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content77% 
IMG OID 
ProductLeucyl aminopeptidase 
Protein accessionYP_003636109 
Protein GI296128859 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCCCC GCTCCACCGG CGCGGCGGCC CTGCCGCCGC GCACCCCGCC CGCCGTCACG 
CTGCACGCGG CGAGCGTCGC CGACTCGGAC CTCCTGGACG ACGGGTCGGT CGACGCCGTC
GCCGTGCAGG TGGCCCCGGG CCGCGACGGT GACGATGCGC TGCAGCCCCG CTCGGGCACG
CCCCAGGCCG CCGCGCGGTA CGGCATCGAC CTGGCCGAGC TCGCGGAGCG CGCCGGTCTG
ACGGGTGCTG CCGGTGAGGC GTTCACCGTC CACCTGCCGC TGCCCGTCGG CTCGTCCGTC
GAGCTGCCGT GGGCGGGACT GCCGCCGCGG ATCGTCCTCG TGGGCGTCGG CGACGAGGGC
CCGACGGCGC TGCGTCGGGC AGGTGCGGCG CTCGCGCGTG CGACGCGCGG GCTGACGCGG
GTGGCCGCGA CCGTGGGCGC GCAGACCCAC CACGACGAGG CGGGCGCAGC GCAGGCCGCA
CGCGCCGTCG CCGAGGGCTA CCTGCTGGCC GCCTACCGGC AGCCGCGCAC GACCCGGACC
CCGGACGACG AGCGGCCCGC GGAGCTCGTG CTGCTCGGTC GCGACGGTGC GGCCGTCGCA
GCCGCGGTCG AGACCGCGCG GACGGGCGCC GAGGCGACCT GGCTGGTGCG GGACCTGGCG
AACACGCCGT CCAGCGTGAA GGACCCCGCG TGGATGGCCG ACCGCGCGCG GCGCCTCGGT
TCCAGGGCCG GGCTCGACGT GCAGGTGCTC GGGCCGCGCG AGCTCGCCGC GGGCGGGTTC
GGCGGCATCC TCGCCGTCGG TGCCGGCTCG GCGTCGACCC CGCGCCTCGT GCGCCTGACG
TACACGCCCG CGAAGGGCGG CGGGCGGCAC GTGGTGGTCG TCGGCAAGGG CATCACGTAC
GACACGGGCG GGCTGTCCAT CAAGCCGCGC GAGGCGATGG TGCCCATGAA GACCGACATG
GCGGGGTCGG CCGTGGCGCT CGCCACCGTG CTGGCCGCCG CCCGGGCCCA GGTGCCGCAC
CGCGTCACCG CGGTGCTCCC GCTCGCGGAG AACCACGTCG GTGCAGCCTC CTACCGGCCC
GGGGACGTCG TGACGATCCA CGGCGGCACG ACCGTCGAGA TCGCCAACAC CGACGCCGAG
GGGCGTCTGG TGCTCGCCGA CGCGCTCGCC TGGGCCGACG CGACGCTGGA GCCCGACGTG
CTCGTCGACG TCGCGACCCT CACGGGTGCC GCGACGCTCG GGCTGGGCCG TCAGCACGCC
GCGCTGTACG GCACGGACGA CGCGCTCGTC GCCGCGCTCA CGGAGGCAGG TCGGCGCACG
GGTGAGCTCG TGTGGCACAT GCCGCTGGTC GCGGACTACG AGGAGGCCGT GCGCTCGTCG
GTCGCCGACC TGCGCCACGT CCCCGAGGAC CGCAGGATCG GGGGCGGGTC GATCACGGCC
GCGCTCTTCC TGCGCCGGTT CGTCGGGCAG CGGGCCTGGG CGCACCTCGA CATCGCCGGC
ACCGGCCGCT CGACGTCGGA CAAGCACGAG GTCACCGAGG GCGCCACGGG CTACGGGGCG
CGCCTGCTGC TGGAGTACCT CGCCGCGCTG GACTGA
 
Protein sequence
MTPRSTGAAA LPPRTPPAVT LHAASVADSD LLDDGSVDAV AVQVAPGRDG DDALQPRSGT 
PQAAARYGID LAELAERAGL TGAAGEAFTV HLPLPVGSSV ELPWAGLPPR IVLVGVGDEG
PTALRRAGAA LARATRGLTR VAATVGAQTH HDEAGAAQAA RAVAEGYLLA AYRQPRTTRT
PDDERPAELV LLGRDGAAVA AAVETARTGA EATWLVRDLA NTPSSVKDPA WMADRARRLG
SRAGLDVQVL GPRELAAGGF GGILAVGAGS ASTPRLVRLT YTPAKGGGRH VVVVGKGITY
DTGGLSIKPR EAMVPMKTDM AGSAVALATV LAAARAQVPH RVTAVLPLAE NHVGAASYRP
GDVVTIHGGT TVEIANTDAE GRLVLADALA WADATLEPDV LVDVATLTGA ATLGLGRQHA
ALYGTDDALV AALTEAGRRT GELVWHMPLV ADYEEAVRSS VADLRHVPED RRIGGGSITA
ALFLRRFVGQ RAWAHLDIAG TGRSTSDKHE VTEGATGYGA RLLLEYLAAL D