Gene Cfla_3066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_3066 
Symbol 
ID9146978 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp3414710 
End bp3415987 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content74% 
IMG OID 
Productvon Willebrand factor type A 
Protein accessionYP_003638148 
Protein GI296130898 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000574906 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGCCACGT TCCGTGCCGA GGTCTTCCAG AACGAGTTCC TGCCCGACGG CGGGACCGAC 
GTCCACGCGA TCGTCACGGT CACGGCCGAG GGCGTGGGCG GCGCCGCGAC GACGGGCGGC
GGGGTCGCCG AGATCATCAT GATCGACACG TCGGGCTCGA TGACGGGCCC GATGCTCGCG
GCGGCCAAGC ACGCCGCGCA GGTCGCGGTC GACACCATCC CCGACGGCAC GTGGTTCGCC
ATCGTCAGCG GCAGCCACGT CGCGCAGCGC GTGTTCCCGT ACCCGAACGC GCCGGTCGCG
ATCGTGCAGA TGGAGCCGGG GGCACGCGAG GAGGCCAAGC GGGCCGTCGC GCGGCTGTCC
GCGCAGGGCG GCACGGCGAT GAGCACGTGG CTGCGCCTCG CCGACCAGAT CTTCGCCACC
CAGCCGGCCG CGACGCAGCG GCACGCGATC CTGCTGACCG ACGGCAAGAA CGAGTCCGAG
CCGCGAGCCC AGCTCACGTC GACGATCCAG GCCGTCACCG GACGGTTCCA GTGCGACGCG
CGCGGCGTCG GCGAACGCTG GCAGGTCGAC GAGCTGCGCG AGATCGCCAC CGCGCTGCTG
GGCGGCGTCG AGCTCATCGC CGACCCGGCC GACATCGCGA AGGACTTCCA GGCGCTGCTC
GCGACGTCCC TGTCGCGCGG CGTCGCCGAC GCGCAGCTGC GGGTGTGGAC GCCGCAGGGC
GGTCAGGTCC TGTTCGTGCG GCAGGTCGCC CCCACGGTCG AGGACCTCAC GGCCCGCCGC
ACCGAGGTGA CGCCGCTGAT CGGCGCCTAC CCGACGGGCG CGTGGGCCGA CGAGTCGCGC
GACTACCACG TGGCGGTGCG GGTGCCGTCG AAGACGGTGG GTGCCGAGCA GCTCGCGGCG
CGCGTGCAGG TCGCGGTCGC CGACGAGGTC GTCGCGTCGG GCCTGGTGAA GGCGGCGTGG
TCGGACGACG CGTCGCTCAC CGCACGCATC AGCCCCGAGG TCGCGCACTA CACCGGGCAG
GCCGAGCTCG CGTCGGCCAT CCAGGAGGGC CTGGCGGCCA AGGCCGCGGG CGACGAGGCC
ACCGCGACCG TCAAGCTGGG CCGCGCGGTG CAGCTCGCCG CCGAGACCGG CAACGAGGAG
GCGACGTCCA AGCTGCGGCG CGTCGTGGAG ATCGAGGACG AGGAGCACGG CACGGTGCGG
CTCAAGCGTG GCGCGTCCCG CCTGGACGAG ATGGCCCTGG ACACCGCCTC GACGAAGACC
TCGCGGGTGC GGCGATGA
 
Protein sequence
MATFRAEVFQ NEFLPDGGTD VHAIVTVTAE GVGGAATTGG GVAEIIMIDT SGSMTGPMLA 
AAKHAAQVAV DTIPDGTWFA IVSGSHVAQR VFPYPNAPVA IVQMEPGARE EAKRAVARLS
AQGGTAMSTW LRLADQIFAT QPAATQRHAI LLTDGKNESE PRAQLTSTIQ AVTGRFQCDA
RGVGERWQVD ELREIATALL GGVELIADPA DIAKDFQALL ATSLSRGVAD AQLRVWTPQG
GQVLFVRQVA PTVEDLTARR TEVTPLIGAY PTGAWADESR DYHVAVRVPS KTVGAEQLAA
RVQVAVADEV VASGLVKAAW SDDASLTARI SPEVAHYTGQ AELASAIQEG LAAKAAGDEA
TATVKLGRAV QLAAETGNEE ATSKLRRVVE IEDEEHGTVR LKRGASRLDE MALDTASTKT
SRVRR