Gene Francci3_3118 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3118 
Symbol 
ID3904244 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3690085 
End bp3691395 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content72% 
IMG OID637880439 
Productaminotransferase, class V 
Protein accessionYP_482204 
Protein GI86741804 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.646431 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCCCC GCCGATCCAA CCGGACGACA TGCTGTCTTC GACACGCTGT AGATGGTGCG 
GCAGCAGCGT ATCTCAGCAG CCGCGAACCT TCTCGCCTAA GGCCCCCGTC AGCGCCTGTG
ACGCACGCCC CTCCCGGCCG TTCACGGCGT ACCGTTTCAC AGGTGCCGGC CTATCTCGAC
CACGCGTCGA CCACGCCGCT GCACCCTGCC GCGCGCGAGG CTCTGCTAAT GGCTCTTCAG
GATGGTTGGG CCGACCCGGC ACGGTTGTAT CGGGAGGGCC GCCGGGCGCG CATGCTGCTC
GACGCGGCCC GCGAGACCGT CGCGGGCGTG CTCGGGGCCC GGCCGGACGA GATCAGTTTT
CCCGCGAGCG GGTCGGCGGC GGCGCATCTG GCCCTGTTGG GCACGGCGGC GGCCCGGCGG
CGTGCGGGTG ACGTCGTCAT GGTCAGCGCG GTCGAGCACT CCAGCGTCCT GCACGCCGCG
CAGCGGCACG AACAGGCCGG TGGACGGGTC GTCAGGATTG GTGTGGATCA TCTCGGCCGG
GTCGACCCCG CCGACTTCAC CCCCGTCGCC GGTACCGCCG TCGCCAGCCT CCAGCACGCC
AACCACGAGG TCGGGACCAT CCAGCCGGTC GCCGAGGTCG CCGAACGGAT GCGCGCCGCC
GGGGTGCCGC TGCACACCGA CGCCGCCGTG ACAGTCGGCC ACATCCCCGT CGACCTGGCC
GACCTCGGGG TGGATCTGCT CACCGCGAGT GCCCACAAGT TCGGCGGACC ACCCGGGGTG
GGCGTTCTCG CGGTGCGCAC CGGGACCCGC TGGCGCAGTC CCGGTCCCGT TGACGAGCGG
GAGGGCGGTC GGGTTGCGGG CTATCCGAAC GTCCCCGCCG TCGTCGCCGC AGCCATGGCG
CTGAGCGCCC GGGCGGGTGA ACTCGCCGCG GAGGCGCCCC GGCTCGCCGG CTACGTGGCC
GAACTGCGCC GCCGGCTGCC CGAACTCGTC AACGGCGTGG AACTACTCGG CGATCCGGAC
CGGGCGGCGA CGGTTCCGCA CATCAGCGCG TTCTCCTGCC TCTACGTCGA GGGCGAGGCG
CTGCTGACCG AGCTCGACCG GACCGGAATC GCCGTGAGCT CCGGGTCGAG CTGCACATCC
GACACCCTGA TCCCGAGCCA TGTCCTGGTC GCCATGGGCG CGTTGACGCA CGGCAACCTG
CGGATATCCT TTGGCCGCGA GTCGACCCAG GCCGATCTCG ACGCCCTGCT GACGGCGCTG
CCCGCAGCCG TACGCGCCGT GCGGGACCGG CTGGGCGCCG CCGGGCTTTG A
 
Protein sequence
MSPRRSNRTT CCLRHAVDGA AAAYLSSREP SRLRPPSAPV THAPPGRSRR TVSQVPAYLD 
HASTTPLHPA AREALLMALQ DGWADPARLY REGRRARMLL DAARETVAGV LGARPDEISF
PASGSAAAHL ALLGTAAARR RAGDVVMVSA VEHSSVLHAA QRHEQAGGRV VRIGVDHLGR
VDPADFTPVA GTAVASLQHA NHEVGTIQPV AEVAERMRAA GVPLHTDAAV TVGHIPVDLA
DLGVDLLTAS AHKFGGPPGV GVLAVRTGTR WRSPGPVDER EGGRVAGYPN VPAVVAAAMA
LSARAGELAA EAPRLAGYVA ELRRRLPELV NGVELLGDPD RAATVPHISA FSCLYVEGEA
LLTELDRTGI AVSSGSSCTS DTLIPSHVLV AMGALTHGNL RISFGRESTQ ADLDALLTAL
PAAVRAVRDR LGAAGL