Gene Francci3_2377 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2377 
Symbol 
ID3904584 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2754597 
End bp2755910 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content65% 
IMG OID637879707 
Producthypothetical protein 
Protein accessionYP_481473 
Protein GI86741073 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.169867 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGAAC CGAACCACCT GTTCCGGGCG GCCCGCGTCC GTGTGGAGTC GCCCGAGGCA 
CCAGGCGAGC CCCTGGCCCG ACGCGAGCTA GCCGAGATGG TCAACGCCTG GATCTACGAG
CAGACCGGGC GTGAGACGGC GATCGACGGG AACTACATCG GCAAGCTGGA ACGCGGCTTG
ATCCGCTGGC CTGACGCGCT GTACCGGCAG GCGTTTCACG CCGTCCTGGC CACCAAGCAA
GATCGGGAGC TGGGCTTCCG TCGACCACGT CGCACAGAGC GGAACACTGA GGACGTGAAC
CGAAGCGAGT TCTTGCGCGC CATGGCAGGC GTAGGCGCCG CCGTCGTGAC CGGCTCCTTC
ACCGAGCTGG CCACGAGTCC CCCTCCCGAC GTGCCCGCCG TCGTGGGGCG CGAAGAGATT
GATCAACTGC GAATGGCGGC GCGTCTGTTC GGAACGGTTG ACCACACGTA CGGCGGCGGG
CTCGTACGCG AGGCGGTCGC CGGCCAGCTC CGCTACGCCG TGAATCTCCT GAACGCACGC
TGCCCGGAAC GTCTGCGCGA CGAACTGTTT ACCTCGGTCG GCTTCCTCGC GCATACCGCA
GCGTTCATGG CGTTCGACGC CTACGTCCAC GAGGATGCTC GAAGAATGTT CTCTCTCGCG
CTCGCGTGTG CCGAGGAGGC GGGTGACTGG CACCTGAGAG CAAAAGCGCT TTCGTCGATG
GCCAGGCAGG CTATCTGGCG AGGCGACCCT GATATGGGCC TCACGCTGGT AGACGTCGCT
TTCATTCGTG CCGAACGCCT TACGCCCACC GAACGCGCCA TGCTCCTTGC CGCCAAGGCA
CGTGCGCTCG CAAAGCTGGG TCGGGAGCAG GAGACGCTAC GGATCATCGG CCAGGCCGAC
GACGAGTTTA GCCAGTCCGA GCCGGAAAAC GATCCGGTCT GGATGGCCTA CTACGACCTC
GCTCAACACG TCGGAGACAC CGGTCACGCC GTTTTCGATC TGGCGGTCAG AAACGCCAGG
CACGCGCCGG AGGCTGCGCG GCGTCTGGAC GGTGCGGTCA CTGGACACAG CGCCACGTTC
GCCCGGTCCA TAGCCATCAG CCAGACAAAA CTCGCGTCGC TCATGATGAA GACCGGCGAC
CCGCAAGAAG CGGCCATCGT CGGCTCCGCG GCACTCCGCG CCGCAGGACA AGTACATTCT
CGACGGGCGG CAGATGACAT ACGCGAGCTA GGAACATTCG CGAAACGACA CGAACGGATC
GATGAAGTCG CGGAACTCAG GCATAACATT CAGACGGTCA CCAGCTCCGC ATGA
 
Protein sequence
MAEPNHLFRA ARVRVESPEA PGEPLARREL AEMVNAWIYE QTGRETAIDG NYIGKLERGL 
IRWPDALYRQ AFHAVLATKQ DRELGFRRPR RTERNTEDVN RSEFLRAMAG VGAAVVTGSF
TELATSPPPD VPAVVGREEI DQLRMAARLF GTVDHTYGGG LVREAVAGQL RYAVNLLNAR
CPERLRDELF TSVGFLAHTA AFMAFDAYVH EDARRMFSLA LACAEEAGDW HLRAKALSSM
ARQAIWRGDP DMGLTLVDVA FIRAERLTPT ERAMLLAAKA RALAKLGREQ ETLRIIGQAD
DEFSQSEPEN DPVWMAYYDL AQHVGDTGHA VFDLAVRNAR HAPEAARRLD GAVTGHSATF
ARSIAISQTK LASLMMKTGD PQEAAIVGSA ALRAAGQVHS RRAADDIREL GTFAKRHERI
DEVAELRHNI QTVTSSA