Gene Francci3_1784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1784 
Symbol 
ID3904014 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2121348 
End bp2122412 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content74% 
IMG OID637879122 
Producthypothetical protein 
Protein accessionYP_480889 
Protein GI86740489 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00707471 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGAAGGTC ACCCGGTACG CGGTCGAACC GGTCGGGTCA TGCTGACCCG CAGCGTCCTC 
ATCGGCGCCT GCCTGCTGGG CTTCCTCGCG ACCACAGCAC CCGCCTTCGC CGACAGCTAC
GCCACCGCGG ACTGCGCGCA GAACCCCCAC CCCGGCTGCG AGGTAAGCGC CGGCACGAAC
GGCACCGCGC CCACACCACC CCGCCGTGAC GTCCGGCCCG GGCCGCCGGG CGGCACCTCG
ACCGGCCGCG GCGGCGGGGA GAAGAGCGCA CGTCCGCCCG GTGACCTCTC ACTCGACCCC
TCGGAACTCG CGGGCTGCGC CTACACGCGC AGCGACTTCC AGCCCGAGGG CGACCCGATC
CGGCCGGTAG CCTTCCGCCC AGCTCCACAG GGCAGCCGAC CGCGCGTCGT CGCCGCCCTG
GACCGCCTCG GCGCGCCACA GGTGCAACCC GTCGCCACCG CGGCAGACGG GCAGCCCGGA
GCCTGGTACG TATACCAGTG CCAGGGCGAC GGATGGCACG ACGCGCTCTA CCGGCCACCG
GTATGGATCG CGGACGGGCA GGCCGGCCCC ACAGCCACCG CACCCGACCC GGCCACCCTC
GCCGAGCAGG CCCGCAACCA ACTGCGCCTC CAAGGCCCCG CCATCGCCTT CAGCCCAACC
AGACGGCAGC TCGTGCGACT ACCAACCTGG ATGTGGCTCG ACCCCGCCAG CTGGCGACCC
GTCTCCGCCA CCGCCGCCGC CGGCGGAGTC TCGGTGCAAG CGGTCGCCAC CCCCGCGCAG
GTCGTATGGT CCATGGGCGA CGGCACCGAC GTGCGCTGCA CCGGCCCCGG CACCCCCTAC
CAGCCCACCG TCGACCCGAC GGCCGCCTCC CCCGACTGCG GCCACACCTA CACCACCGAC
TCCGAAGACG AACCCGGCGG CGTCTTCCCC GTGTCCGCCA CCGTCACCTG GAACGTCACC
TGGGCCGGCG GCGGCCAGAA CGGCACCTTC AACGGGCTCA CCACCATCTC CACCGCCCAG
GTCAGCGTCA TCTCCGTGCC CGCACTGACC ACCGGCGGAG GCTGA
 
Protein sequence
MEGHPVRGRT GRVMLTRSVL IGACLLGFLA TTAPAFADSY ATADCAQNPH PGCEVSAGTN 
GTAPTPPRRD VRPGPPGGTS TGRGGGEKSA RPPGDLSLDP SELAGCAYTR SDFQPEGDPI
RPVAFRPAPQ GSRPRVVAAL DRLGAPQVQP VATAADGQPG AWYVYQCQGD GWHDALYRPP
VWIADGQAGP TATAPDPATL AEQARNQLRL QGPAIAFSPT RRQLVRLPTW MWLDPASWRP
VSATAAAGGV SVQAVATPAQ VVWSMGDGTD VRCTGPGTPY QPTVDPTAAS PDCGHTYTTD
SEDEPGGVFP VSATVTWNVT WAGGGQNGTF NGLTTISTAQ VSVISVPALT TGGG