Gene Francci3_2654 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2654 
Symbol 
ID3906327 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3133374 
End bp3134732 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content71% 
IMG OID637879979 
Producthypothetical protein 
Protein accessionYP_481745 
Protein GI86741345 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0469483 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0556402 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAGTG CGTCCACGAC CCCCCCCGAC CGCACCGGTG GCCTGCGCCT GCCGGTGACC 
GCCCCCTCCG CGGAGGGCGA GGTGGTCGAG CTCTGTCGGG AGATGCTCCG CTTCGAGTCG
GTCAACCGCG GCAACGGCGA CGGCAACGAG CGCCCGATCG CGGAGTACGT CGCGGCGAAG
CTCGCGGAGG TGGGCCTTGA GCCGACCCTG CTGGAGTCGG CCCCGGGCCG CACCAGCGTG
GTGACCCGGG TGGAGGGGGC GGATCCCTCG CGTGCGCCGC TGCTCGTCCA CGGTCATCTC
GACGTGGTGC CGGCGGATGC CAGCGAGTGG CGCCTGCCGC CGTTCGCTGG CGAGGAGGCC
GACGGCTGCC TGTGGGGTCG GGGCGCGGTG GACATGAAGG ACATGGACGC GATGACCCTC
GCGGTCATCC GCGACATCGT CCGCACCGGC CGCAGGCCAC CCCGGGACCT CGTCGTCGCC
TTCGTCGCGG ACGAGGAGGC CGGCGGCGTC CTCGGGGCGC GCTGGCTCGT CGAGAACCAT
CCCGACCTGT TTGCCGACTG CTCCGAGGCG ATCAGCGAAG TCGGTGGGTT CTCCTACACC
GTCTCCGACG ACCTGCGCCT TTACCTCATC GAGACGGCGG AGAAGGGCAT CGCCTGGATG
AAGCTGACCG CTGCCGGCCG GGCCGGCCAC GGTTCGATGA TCAGCGATGA CAACGCGGTG
ACGGCGCTGT GCGAGGCGGT GGCGAGGCTC GGCCGGCACA CCTTCCCGCT GGTCATGACC
CCGACGGTCC GGGTGTTCCT GAACTCCCTC GGGGAGGCGC TCGGCATCGA GTTCGACCTG
GACGACCTGG AGGCGACCGT CGCGAAGCTC GGCCCGATCG CCCGGATGAT CGGTGCCACG
TTGCGTAACA CCGCCAACCC GACCCAGCTC GAGGCCGGTC ACAAGGTCAA CGTGATCCCG
GGCGAGGCCA CCGCCTACGT CGATGGTCGC TATCTGCCCG GTCAGGAGGA GGAGTTCATC
CGCCAGCTCG ACGAGATCCT CGGGCCGGAC ATCCGTCGCG AATGGGTCGT CCACGACCAG
GCGTTGGAGA CCAGCTTCGA CGGCGCGCTG GTGGAGGCGA TGGCCGCGTC GCTGCGCGCC
GAGGACCCGA TTGCCCGGGC CGTGCCCTAC ATGCTCTCCG GCGGCACGGA CGCGAAGTCC
TTCTCCCGGC TCGGCATCCG CTGCTTCGGG TTCTCCCCGT TGCTGCTGCC GCCCGACCTG
GACTTCTCCG GCATGTTCCA CGGGGTGGAC GAACGGGTGC CGCTCGACTC GCTGCGCTTC
GGCGTCCGCG TCCTCGACCG CTTCCTGCAC GCCTGCTGA
 
Protein sequence
MASASTTPPD RTGGLRLPVT APSAEGEVVE LCREMLRFES VNRGNGDGNE RPIAEYVAAK 
LAEVGLEPTL LESAPGRTSV VTRVEGADPS RAPLLVHGHL DVVPADASEW RLPPFAGEEA
DGCLWGRGAV DMKDMDAMTL AVIRDIVRTG RRPPRDLVVA FVADEEAGGV LGARWLVENH
PDLFADCSEA ISEVGGFSYT VSDDLRLYLI ETAEKGIAWM KLTAAGRAGH GSMISDDNAV
TALCEAVARL GRHTFPLVMT PTVRVFLNSL GEALGIEFDL DDLEATVAKL GPIARMIGAT
LRNTANPTQL EAGHKVNVIP GEATAYVDGR YLPGQEEEFI RQLDEILGPD IRREWVVHDQ
ALETSFDGAL VEAMAASLRA EDPIARAVPY MLSGGTDAKS FSRLGIRCFG FSPLLLPPDL
DFSGMFHGVD ERVPLDSLRF GVRVLDRFLH AC