Gene Francci3_2698 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2698 
Symbol 
ID3904922 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3183091 
End bp3184635 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content73% 
IMG OID637880022 
Producthypothetical protein 
Protein accessionYP_481788 
Protein GI86741388 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.570232 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGGCGGG TGGCGCGGGT GAGTCTGCGG GTGGAGCAGA ACGAGTTGCG TGCGCGGATG 
CGGGCGGCCG GGATGACGCA CGAGGAAGCC GCGGTCGAGT TCGCCCGCCG CTACCGGCTC
CGCCCTCGGG CGGCGTTCCG GCATGCGTTC GGATGGACGT TGCAGGAAGC CGCCAACCAG
ATCAACACTC ATGCCACCCG TACCGGCTTC GATCCAGATG GCATACCGGT GATGACCGCT
CCGCGGTTGA GCGAGGTGGA GAACTGGCCG CGCCCCGACC GGCGACGGCT AACTCCGCAG
GTGCTCGCGC TACTCGCCGT TGTGTACGGC ACCGATGTTC ATCGCCTCCT CGATCTTGAG
GATCGGGAGC GGCTGAGCCC GCAGGACCGG CTCCTCCTCC ATCGCATGCA GCGGAACACG
GTGGATTCCG CGCCTCGGGG GCGCAGGAGC ACGCAGTCTG CCGGGACTGT GGGGGCGGTT
CGGACCCGGC CGATGGCCTA CGACCTTTTC CGCCACGAGC AGGGCAACCG CGGCGCGCCA
CAGCCCATGG CCTCCGCCGG AGAGACCGGG TTTCTCGACG GTGACGAGCA GGAGCGGCTA
CGCCGTGCGG TGTCGCGGCC GAGCCGTGTG GATGGTCGGG TGGTGGCGTC GTTGGCGGCG
ATCCTGGCCG AGCAGCGGGC GACCGAGGAC CTGATCGGTT CGGCCCGGCT ACTGGTCCCG
GTCATGGCGC AGTTGGGTGA GGTGGAGCGG CTGATCGGCG AGGCGTCGGG GCAGGTGCGG
GGGCCGCTGG TGGAGATCGG GGCGCAGTGG GCGGAGTTCG CCGGCTGGCT GCACATCTCC
ACCGGCCGGT GGGCGGCGGC CCGTGGCTGG CTGGACCGGG CCGCGGAGTG GGCGTTCGAG
GTGGACGCGA CCACCCTGCA CGCGACCACA ATCAGCTTCA AGGGTCATCT GGCGTTCCAC
CTCGGCCAGC TCGACGCCGC GGTGGGCCTG TCGCGGGCGG CGTCGCGGGA CGAGCGGGTG
TGGGTGGGCC AGCGGGCCTA CGACGCCCAC CAGGAAGCCC GCGCCCACGC GCTCGCGGGC
CGCCGCCGGC CGGCGGTCGA GGCGCTGGCT CGGGGCGCCG ATCTCGCTGC CGCCGCAGCG
GCGGACGGCG AGGCGGCCCC GGCGTGGATC TACTACTACA CGCCGGAGTT CTACGCGTTG
GAACGCGGCT GGGTCTGCCG CTACCTCGGC CGCGACGACC CGGCCTCCAA CGAGGAGGCG
ATCGCCTGCC TCACCCGCGG ACTCGCCGGC CTCGGCGACG CCCGCACCTC CGGCTGGGCG
GCGGAGTTCC TCTGCCACCT AGCCGCCGCC TACCTCCAGG CCGACAGCCC CGACCTCGCC
GGCACAGCCG GTATCGAGGC GGCGACGATC ATCGCCGCGA CGGGGTCCGT CCGGCTTCTG
CCACGGCTGC GGCGGTTGCA CGCCGACCTG GCCGCACGCT GGCCGACAAG CCCCACCACA
GCTGATCTGG GTGAGGCCCT TCGGCTCGGC CGGGACGACG GGTAG
 
Protein sequence
MGRVARVSLR VEQNELRARM RAAGMTHEEA AVEFARRYRL RPRAAFRHAF GWTLQEAANQ 
INTHATRTGF DPDGIPVMTA PRLSEVENWP RPDRRRLTPQ VLALLAVVYG TDVHRLLDLE
DRERLSPQDR LLLHRMQRNT VDSAPRGRRS TQSAGTVGAV RTRPMAYDLF RHEQGNRGAP
QPMASAGETG FLDGDEQERL RRAVSRPSRV DGRVVASLAA ILAEQRATED LIGSARLLVP
VMAQLGEVER LIGEASGQVR GPLVEIGAQW AEFAGWLHIS TGRWAAARGW LDRAAEWAFE
VDATTLHATT ISFKGHLAFH LGQLDAAVGL SRAASRDERV WVGQRAYDAH QEARAHALAG
RRRPAVEALA RGADLAAAAA ADGEAAPAWI YYYTPEFYAL ERGWVCRYLG RDDPASNEEA
IACLTRGLAG LGDARTSGWA AEFLCHLAAA YLQADSPDLA GTAGIEAATI IAATGSVRLL
PRLRRLHADL AARWPTSPTT ADLGEALRLG RDDG