Gene Francci3_3995 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3995 
Symbol 
ID3906956 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4779225 
End bp4780832 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content71% 
IMG OID637881324 
Producthypothetical protein 
Protein accessionYP_483074 
Protein GI86742674 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGCGGTG GCGTCATGAG CAAGCTCAGC CGGCGGGTGG AGCAGGAGGA GTTGCGGGCC 
CGGATGCGCG CGGTGGGTAT GTCCCACGAC GAGATCGCTG TCGAGTTCGC CCGCCGCTAC
AAGTTGCGGC CCCGTGCCGC CCACCGCCAC GCCCGCGGCT GGACCCAGAC GCAGGCCGCC
AACCACATCA ACACCCACGC CGCCCGCGTC GGCCTCGACC CGGACGGCGC CGCACCCATG
ACCGGCCCGA AGCTGTCGGA GCTGGAGAAC TGGCCGTTGC CGAACAACCG CCGCCGGCCC
ACCCCCCAGA TCCTCGCCCT GCTCGCCGAG GTCTACGACA CCAGCATCCA CAACCTGATC
GACCTTGACG ACCGCGAACA GATGCCTCCT GCTGATCTGC TGCTCATCAC CACGATCCGT
GAGAGGGCCG TGCCGGCGAG CGCTATCGGT TCGCTCCCAC AAGCGCAGTC GGCAAGCTCG
GAGGTCACGC CGATGGTGGA TGCGCCGAAT CGGCAGCAGT TCCTGCTCGC GGCGTCAGCC
TTGGGCGTTG CCGTGGTGCT GCCCCGACAG CCGGCCGCGC CGTCCCCGTC CGTGAGCGTC
CGGTCGACCG TGTTCCCGGC CGCATGCGAT CTTCTCGCTG ATCTGCGAGA AGCCATCACG
GCACCGGCGG AGTGGTCCAC CGATCCCGAC CCGGCCTCAT TCGCGGGCCC TGCCGACCTT
GACGCTCGCG CGCGGGAGTG CCACGACCGC TACCAGCGGG CCGACTACGC AGGCACGGCG
AGACTTCTCC CCGCGGTGGT GCGGGGCATC GAGACACTCA CGGTCGATCC GCCGACTGGC
GTGAACCACC GCGCGGTCCG GCGGACGCAG GCCGTCGCGT ACATCGCTGC CGCCAAGCTC
GCGACCAAGA CCGGCGACCA CGATCTCGCC TGGCTGGCCG CAGACCGTGG CCAACACGCG
GCGCTCGCTG CCGACGCGCC AGCGCTACTG GCGACAGCGC GCAGACAGAT CGCCTGCGTC
TTCCACGACA CGGGACGGCT GGCCGACGCC GAACGGGTCG CGGTCAGCGC CCTCGACGCC
CTGAACCAGC GACCAGGCGA CGAGGACCAC CGCGACCTCT CGTCCGCGCG GGGCGCTCTT
CTTCTGCTCT CGGCAATGAC CTCGATCCGC CGAGGCGAAC GGACGGAAGG CCGCCGCCGG
CTCACCGCCG CGGCCGAGCA GGCTGACGCG CTTGGCCGGG ACAACAACCG GCTGTGGTCG
GCGTTCGGGC CGACGAACGT CGCGATCCAC ACCCTTACCG CCACCCTGGT ACTGGACGAT
CCGACAGAGG CGGTCGGCGT CGGCGAGCAG ATCGACACAC GTCTGCTGCC ACCCCCGCTG
GCCGGCAGGC GCGCACGTCT GCACGTAGAT CTTTCCGGCG GGCATGCCCG CCTGGGCGAG
GATGCCATCG CGGCGGTGCA CATCCTTGAC GTCGCCCGCC GGGCGCCGCA GCTGCTGAGG
GTTGATCCGA CAGCTCGGGC TGTGCTGGCG ACACTGCTCG GCCGTGCCCG CGGCTCCACC
GTCTCGGTCC TACGGAGTGT CGCGGAGCAG GCCGGAGTCG CAACGTGA
 
Protein sequence
MCGGVMSKLS RRVEQEELRA RMRAVGMSHD EIAVEFARRY KLRPRAAHRH ARGWTQTQAA 
NHINTHAARV GLDPDGAAPM TGPKLSELEN WPLPNNRRRP TPQILALLAE VYDTSIHNLI
DLDDREQMPP ADLLLITTIR ERAVPASAIG SLPQAQSASS EVTPMVDAPN RQQFLLAASA
LGVAVVLPRQ PAAPSPSVSV RSTVFPAACD LLADLREAIT APAEWSTDPD PASFAGPADL
DARARECHDR YQRADYAGTA RLLPAVVRGI ETLTVDPPTG VNHRAVRRTQ AVAYIAAAKL
ATKTGDHDLA WLAADRGQHA ALAADAPALL ATARRQIACV FHDTGRLADA ERVAVSALDA
LNQRPGDEDH RDLSSARGAL LLLSAMTSIR RGERTEGRRR LTAAAEQADA LGRDNNRLWS
AFGPTNVAIH TLTATLVLDD PTEAVGVGEQ IDTRLLPPPL AGRRARLHVD LSGGHARLGE
DAIAAVHILD VARRAPQLLR VDPTARAVLA TLLGRARGST VSVLRSVAEQ AGVAT