Gene Cfla_1820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_1820 
Symbol 
ID9145713 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp2029831 
End bp2031036 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content70% 
IMG OID 
Productaminodeoxychorismate lyase 
Protein accessionYP_003636916 
Protein GI296129666 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.111847 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.624649 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCAACC AGACGGAGTG GTGGGCCCCG GTGAAGTCCG ACGAGAGCGT GACCGACCTG 
TTCGGCGGTG AGCGCGTCGC GGCGGCGCCG GGTGCGCCCG AACCGCGACG ACGGTCCCGC
TCGTCGGGCC GCAAGCGTGA GGAGCGGATG CGCAAGCAGC GCCGCCGCCG GTCGGTCTCC
GTGCTCGTCG TCGCGCTCGT GCTCGTCGCC GGCGCCGGAT ACGTCGTCTT CTCGCTGCTG
GGAGGCCAGC TGTTCGCCGG GTCCGGCCAG GAGCGCGTGA CCGACTACCC CGGTGCCGGG
CGCCCCGGGG CGCCCACGAT CGTCATCAAC GCCGGTGACA CCGGCGCAGC GATCGCTGCG
ACGTTGTACG ACGCCGGCAT CGTCGCGTCC GAGGCGGCGT TCCGCGAGGC GTTCGACGCC
AACCCCGACG CGGCCGGTAT CCAGCCGGGG ACCTACCAGC TCAACCTCGA GATGAACGCC
GAGCGTGCGG TGCTGGCGCT GCTCGACCCG AAGAGCCGCA AGTCCATGAA GCTCACGATC
CCCGAGGGCT GGACGGCCGA CGAGATCTTC GCGCGCATCA ACGAGGTGAC GCTCGTCCCG
GTCGAGGAGC TCAAGGCTGC GGCGTCCGAC CCTGCCGCGA TCGGACTGCC CGCCGAGGCG
GGAGGCAACC TCGAGGGCTG GCTCTTCCCG ACGACCTACC AGGTCGAGCC GAACCCGACG
GCGCAGTCCG TGATCGCGCC GATGGTGGCC AAGACCGTCG AGACGCTGAC GTCGAAGGGC
GTCCCCCAGG ACCAGTGGCT CGACGTCCTG AAGAAGGCGT CGCTCATCGA GAAGGAAGCG
GTCCTCGACA GCGACCGGCC GATGATGGCC CGCGTCATCG AGAACCGGCT CGCGCAGGGC
TGGCCCCTGC AGATCGATGC GACGCTCGTC TACGCCCTCA AGAAGCCCGG CAACGAGCTG
ACGCAGGCCG AGCTCGAGGA CACGTCGAAC CCGTACAACT CCCGCAAGCT CAAGGGGCTC
CCCCCGACGC CGATCGCGTC GCCGGGCATC CCCTCGATCG AGGCGGCGCT GGCACCCGCG
GCCGGGGACT GGATGTTCTG GGTGACGGTG AACCTCGAGA CCAGCGAGAC GAAGTTCGCC
ACGACCCACG ACGAGTTCCT CGAGTACAAG GCCGAGTACC AGGCGTGGGT GGAGGAGAAC
CGCTAG
 
Protein sequence
MSNQTEWWAP VKSDESVTDL FGGERVAAAP GAPEPRRRSR SSGRKREERM RKQRRRRSVS 
VLVVALVLVA GAGYVVFSLL GGQLFAGSGQ ERVTDYPGAG RPGAPTIVIN AGDTGAAIAA
TLYDAGIVAS EAAFREAFDA NPDAAGIQPG TYQLNLEMNA ERAVLALLDP KSRKSMKLTI
PEGWTADEIF ARINEVTLVP VEELKAAASD PAAIGLPAEA GGNLEGWLFP TTYQVEPNPT
AQSVIAPMVA KTVETLTSKG VPQDQWLDVL KKASLIEKEA VLDSDRPMMA RVIENRLAQG
WPLQIDATLV YALKKPGNEL TQAELEDTSN PYNSRKLKGL PPTPIASPGI PSIEAALAPA
AGDWMFWVTV NLETSETKFA TTHDEFLEYK AEYQAWVEEN R