Gene Francci3_3165 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3165 
Symbol 
ID3903887 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3744206 
End bp3745537 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content73% 
IMG OID637880486 
Producthypothetical protein 
Protein accessionYP_482251 
Protein GI86741851 
COG category[S] Function unknown 
COG ID[COG4198] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0143624 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.30892 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCTCA TGGTTGACGC CGATCGTCCC GTTCCCGCTC CGGCGGGTCT CGTGCTCGCC 
CCGTTCCGGG CCGCACGATT TCCGTCCTCC GGGCCGGACC TCGCCCCGCT GACCTCCCCG
CCCTACGACG TCATCGACGA CGCCGAACGG GCGGAGCTCC AAGCGCGCGA CGAGCGCAAC
GTGGTCCGGC TCATCCTCCC CGGGGAGGAC TACGACGGGG CCGCCCGCAC GCTGCGGGCA
TGGCTGGACA GCGGAGTGCT GCGTCGCGAC GAGAAGGCCT CCGTCTACGT CTACGAGGAG
GAGCGGGCCG GCCACGCCCA GCGCGGGCTG ATCGGGGCGG TCGCGCTGAC CGATCCGGAT
GCGGGGATCA TCCTCCCGCA CGAGAACACC ATGGCGGGCC CGGTCTCAGA CCGGTTGGCG
CTGACCCGCG CGACCCGCGC GAACCTGGAA CCGATCTTCC TGCTCTACGC CGGCGGCGGC
GAGACCAGCC GGGTGGTCTC GATGGTGATC GCCACGACGC CGCTGGTGGA GACGTCCACG
GACGACGGGG TGACGCACCG GCTCTGGGCC ATCGACGATC CGGCGGTCCT CACCGCCATC
GCCGCGGACC TGTTGCCCCG GCGCGCGGTG ATCGCGGACG GCCACCATCG GTACGCCACC
TACCGCCAGT ACCAGGCGGA ACGGCACGCC GCCGGGGATG GTTCGGGCCC CTGGGACTTC
GGTCTGGCCT TCCTCGTCGA CGCGACCGTC TCGGGGCCCC AGGTGCACGC CATCCACCGG
GTAGTGCGTG GTCTCGGGCT CACCGAGGCG GTGCGGCGGG CCGCCGAGGT GTTCACCGTG
CGTCAGCTCG CCGGGCCCGG CGAGGGTGGT ACCGCCGCCG GGGATGCCGG TGGCGTGGGG
CCGGCAGGCG CGGACCCGGA CGCGCTGGTG GAGGAACTGG CCAAGGCCGG GCAGGGCGGG
CACGCGTTCG TGGTCACCAA CGGCACCGCG GCCTACCTGC TCACCGAGCC CGACGCCGAC
CTGCTCACCC GTAGTCTGCC CCCCGAACGG TCGGCGGCCT TCCGTGGACT CGACGTCACC
GTCGCTCATC TTGCGTTGAT CGTGGACGTC TGGGGGTTGA CGGACACGGT GGGCGTGGTC
GACTACCACC ACGACGCGCC GGCCGCGATC GCCGCGGCGG CTGCGGCGGG AGGTACCGCG
CTGCTGCTCA ACCCCACCCC GATCGCCGGT GTGACGGCCG TCGCCGAGGC CGGCGAGCGG
ATGCCGCGCA AGTCGACGTT GTTCACCCCG AAGCCGCGCA CCGGACTCGT GCTGCGTCCA
CTCGACGACT GA
 
Protein sequence
MRLMVDADRP VPAPAGLVLA PFRAARFPSS GPDLAPLTSP PYDVIDDAER AELQARDERN 
VVRLILPGED YDGAARTLRA WLDSGVLRRD EKASVYVYEE ERAGHAQRGL IGAVALTDPD
AGIILPHENT MAGPVSDRLA LTRATRANLE PIFLLYAGGG ETSRVVSMVI ATTPLVETST
DDGVTHRLWA IDDPAVLTAI AADLLPRRAV IADGHHRYAT YRQYQAERHA AGDGSGPWDF
GLAFLVDATV SGPQVHAIHR VVRGLGLTEA VRRAAEVFTV RQLAGPGEGG TAAGDAGGVG
PAGADPDALV EELAKAGQGG HAFVVTNGTA AYLLTEPDAD LLTRSLPPER SAAFRGLDVT
VAHLALIVDV WGLTDTVGVV DYHHDAPAAI AAAAAAGGTA LLLNPTPIAG VTAVAEAGER
MPRKSTLFTP KPRTGLVLRP LDD