Gene Francci3_1434 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1434 
Symbol 
ID3903165 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1726274 
End bp1728592 
Gene Length2319 bp 
Protein Length772 aa 
Translation table11 
GC content72% 
IMG OID637878771 
Producthypothetical protein 
Protein accessionYP_480540 
Protein GI86740140 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.709183 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.106581 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGGGG ACGTACCGGC CGACGAACAC GCGCTCATCA TCTCGGCCGG AGGGGAGACG 
GGCCGAGCAC TCCGTGCGGC GTTCGCGGCG ATCGCACCCA CCGACGCGTC CTCGGATGAC
GGGGTAGACC AGACCGCCGA GGGCGACGGC TCGCCAGCCG ATCCCACCCT GGCCAACGAA
CGACGGCCGC TGCACCGCTA CGGCAGGGTC CACGTGTTGG GCCAGCCTCG GGGCACCTCC
ACGGCGACGG CGGAGCGCAC CGCCGCCCGG CTGGCGGCGG CGGTACTCCC CCCCGCGGAC
GGCAGCGTCG TGTCGACCAC CGGGGTGTCG CCCGGGCTGA ACCGCGCCGA GACGCTCGGG
CTGGACGCCT TCCGCCTGCG GGCCAGCGCC GCCTACCGGC AGCTCAAGCA GGAACGTCCC
CGCGACGGCC GGCCATGGGA CATGGACCGG CCCTGCACCG ACATCCCGCC GCCCCGGGGC
AGCTCGCCTG GTCCGGACCA GCCGACACCA CATGCCCAGC CGACACCGCA TGCCCAGCCG
ACACCACATG CCCAGCCGAC ACCGCATGCC CAGCCGACAC CGCATGCCCA GCCGACACCG
CATGCCCAGC CGACACCGCA TGCCCAGCCG AACGGGCGGG CCGAGGCCCA GAGCCCGACC
TCCGCGCCCA CGGACACCCG GGCCACGGAC ACCCGGGCCA CCGCGCAGAT CGCCCAGGCG
TCCTCCCCCA CCAGCGCGGC GCTGGAGGGA TCGGTGGCCG TCGGGCTGAT CCTCGTGGAA
GGGCCGACCA CCGCGCTGCA ACTGACCGCC GCGGAACGCA CCAAGATCGT AGCTGAGGTG
CAGAACGGGT TGTCCTGGTA CGCAACCACG AACCCGGCTG CCGACCTGAC CTTCCACTAC
GACATCCAGA TCGTGCGGCT GTCCGTCCCA CCGGACCCGA ACGCGCCCGA CCTCGAGGCG
CTGTGGCGTG ACCCCACCAT GAGCCGGCTC GGCTACGCGG CCAGCTTCGA CGGCGTCTAC
GACTACGTCG ACGCGCTCCG GTCCCGGCTC GGCACCCGGT CCGCCTACTG CGCGTTCTTC
ACCAAGTATC CGCTGGGATA CTTCGCGTAC TCCTCGGTGG GTGGCCCCCG GCTCGTGATG
TCCGTCGACA ACGACGGCTG GGGACCGGAC AATATCGACC GGGTCTTCAC CCACGAGACC
GGCCACATCT TCGGCGCTCC GGACGAGTAC GCGGGCGCCC AGTGCGACTG CGGCGGTCGG
TGGGGGGCGT TCCACGCCCC GAACGGCAAC TGCGACGCCT GCGCGCCCGC GCCCGTCGAC
TGCCTGATGC GCTCGAACAG TTTCGCGCTG TGCCGCTACA CCCCCAGCCA CATCGGCTGG
GGCCACGGAG TAAGCGGCAA CCCGGCGCTC GTCCAGGCCA GGGGGCTCGG CCAGATCGGC
AACTTCGACG CCGTCGTCCC GTCGGCCTTC GCCGGGCTGA CCCACGTCTG GCGGGACAAC
GACGCGGCTG GCTTCCCCTG GATGGCCCCG TGGCAGACGG CTCAGGAGCT CGGCCGGATC
GATGCCGCCA CCATGATCCA GAGCACCCTG GCCAGGCCGG GACCCCTCGA GGTCGCCGTC
CGGGTCGGCT CGACGCTGTA CTTCCTGTGG CGGGACTCCA CCGGCGCCTT CGCCTGGCAC
CCGCCGACCC GCCTCGTCCA GGGGGTGGGG GGCGTTCCGT CGCTGGTGCA GAGCCGGCTG
GGCACCAAGG GCAACTTCGA ACTGCTCGTG CCCGCCGCGG ACGTCGGCAT CCTGCACCTG
TGGCGCAACC ATGACATCCA CGGATTTCCG TGGAGCACTC CGAAGCTGTT CGGCGCGAAC
CTCGGGCGCG TCGACGCCGT CAGCCTCATC CACGGCACGC TGGGCGGCGG CAACGGGATG
CTGGAAGCGG TCGCCCGGGT CGGCAACCGG CTCGTGCACC TCACCCGCGA CAACGGGGCG
GTCTGGCGCA CCGGCCCCGT CTTCGCCGAG GGCGTGACGG GCAATCCGGC GCTCATCCAG
AGCGCCTTCC CCGACGGCTC CCGCAACTTC GAGGTGGTCG TCCCCGCCGC GGACCGCGGG
CTCATCCACT TCTACCGGAA CAACGGCGCG CCCTCCCCGG GCTGGAGCGG ACCGCGGCCG
TTCGCACCGG AGCTGGGCCG GGTGGACGCC GTCTCGATGA TCCAGAGCAA CTTCGACGGG
CATCTGGAGG TGCTCGCCCG CGTCGGCGAC CGGCTCCACA TGGTCTGGCG TTCGTCGGGC
CCCGGTGCAA GCTGGTCGGT CCCCCGGCGC GTGTTCTGA
 
Protein sequence
MAGDVPADEH ALIISAGGET GRALRAAFAA IAPTDASSDD GVDQTAEGDG SPADPTLANE 
RRPLHRYGRV HVLGQPRGTS TATAERTAAR LAAAVLPPAD GSVVSTTGVS PGLNRAETLG
LDAFRLRASA AYRQLKQERP RDGRPWDMDR PCTDIPPPRG SSPGPDQPTP HAQPTPHAQP
TPHAQPTPHA QPTPHAQPTP HAQPTPHAQP NGRAEAQSPT SAPTDTRATD TRATAQIAQA
SSPTSAALEG SVAVGLILVE GPTTALQLTA AERTKIVAEV QNGLSWYATT NPAADLTFHY
DIQIVRLSVP PDPNAPDLEA LWRDPTMSRL GYAASFDGVY DYVDALRSRL GTRSAYCAFF
TKYPLGYFAY SSVGGPRLVM SVDNDGWGPD NIDRVFTHET GHIFGAPDEY AGAQCDCGGR
WGAFHAPNGN CDACAPAPVD CLMRSNSFAL CRYTPSHIGW GHGVSGNPAL VQARGLGQIG
NFDAVVPSAF AGLTHVWRDN DAAGFPWMAP WQTAQELGRI DAATMIQSTL ARPGPLEVAV
RVGSTLYFLW RDSTGAFAWH PPTRLVQGVG GVPSLVQSRL GTKGNFELLV PAADVGILHL
WRNHDIHGFP WSTPKLFGAN LGRVDAVSLI HGTLGGGNGM LEAVARVGNR LVHLTRDNGA
VWRTGPVFAE GVTGNPALIQ SAFPDGSRNF EVVVPAADRG LIHFYRNNGA PSPGWSGPRP
FAPELGRVDA VSMIQSNFDG HLEVLARVGD RLHMVWRSSG PGASWSVPRR VF