Gene Francci3_3349 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3349 
Symbol 
ID3905931 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3971566 
End bp3972927 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content73% 
IMG OID637880674 
Producthypothetical protein 
Protein accessionYP_482435 
Protein GI86742035 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00321833 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.277743 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAGCC GTATAGATCA GCAGCGCCGG GCGTCCGCGC CCGGCGAGGT TCCCAACGCC 
CTGGAAGTCG ACGAGATCGA CGCTGCCGCG GCGCAGCGCC GGGCGTCCGC GCCCGGCGAG
GTTCCCAACC TGGTCGCCTA CGTGGTGCCC GCCGAGGGGG AGAGGCAGCG CCGGGCGTCC
GCGCCCGGCG AGGTTCCCAA CACCGGTTCG TTGACGAAAT GCAGCCCGAA TACGTCGCAG
CGCCGGGCGT CCGCGCCCGG TGAGGTTCCC AACTGCACCG TTGACACATC CCCACTATTC
GACACGGTCG AAGCAGCGCC GGGCGTCCGC GCCCGGCGAG GTTCCCAACG TCAACATCGC
CGACGGATGG GCGCCGTCCT CGAACGCAGC GCCGGGCGTC CGCGCCCGGC GAGGTTCCCA
ACGAGGTGTA CACCACCTCA GACACTCTCG ATGTCGTCGC AGCGCCGGGC GTCCGCGCCC
GGCGAGGTTC CCAACTCCGC GAGGAGGTCG GCGAGGGTGG TCAGGCCATC GGCGCAGCGC
CGGGCGTCCG CGCCCGGCGA GGTTCCCAAC GCCATCCTGC TGTACGGCGT CGCGACGAAG
ATCGAGCAGC GCCGGGCGTC CGCGCCCGGC GAGGTTCCCA ACACCAGCTC CTCGGCGCCG
GGGATGTCCG CTGGGAGCAG CGCCGGGCGT CCGCGCCCGG CGAGGTTCCC AACGACAGCT
CCGAGATCGG CCCGCTCGGG GACCGGCCGG GCAGCGCCGG GCGTCCGCGC CCGGCGAGGT
TCCCAACCCG ATCTGCTGGT CGACATCGCC CCCGAGGGGG AGCAGCGCCG GGCGTCCGCG
CCCGGCGAGG TTCCCAACAC CTCGTGGCAG CAGGTCCAGG ACTACGTCGA GCAGCAGCGC
CGGGCGTCCG CGCCCGGCGA GGTTCCCAAC CGCTCGACAG GTCGATCTCG TACAGGGCGC
CGTCCAGCAG CGCCGGGCGT CCGCGCCCGG CGAGGTTCCC AACGATGGCG CACCGGCGCA
GCTCGGCGTT GCGCTCGGCG AGCAGCGCCG GGCGTCCGCG CCCGGCGAGG TTCCCAACTA
CTCCGGCACC GAGACCGCCG CGAACATCAC AGGCAGCGCC GGGCGTCCGC GCCCGGCGAG
GTTCCCAACC CCGTGGTGGC GGCATCTGCC GCGATCGCGG CGGCGGGCCG TCGGGGGTCC
ATTTCCCACC GAGGACGATG CGCAGAAGGC GCTGATGGCC TCGTTAGCGG AGCTTCACGG
CGGCGACTGG CTGGAAGAGG CCTGACTCGA ACCCACGACC GACGGATTAT GAGATCATCG
GCAGCCTGTT CGCCGGTCTC CATGGACGTC CGCCGGTCCT GA
 
Protein sequence
MTSRIDQQRR ASAPGEVPNA LEVDEIDAAA AQRRASAPGE VPNLVAYVVP AEGERQRRAS 
APGEVPNTGS LTKCSPNTSQ RRASAPGEVP NCTVDTSPLF DTVEAAPGVR ARRGSQRQHR
RRMGAVLERS AGRPRPARFP TRCTPPQTLS MSSQRRASAP GEVPNSARRS ARVVRPSAQR
RASAPGEVPN AILLYGVATK IEQRRASAPG EVPNTSSSAP GMSAGSSAGR PRPARFPTTA
PRSARSGTGR AAPGVRARRG SQPDLLVDIA PEGEQRRASA PGEVPNTSWQ QVQDYVEQQR
RASAPGEVPN RSTGRSRTGR RPAAPGVRAR RGSQRWRTGA ARRCARRAAP GVRARRGSQL
LRHRDRREHH RQRRASAPGE VPNPVVAASA AIAAAGRRGS ISHRGRCAEG ADGLVSGASR
RRLAGRGLTR THDRRIMRSS AACSPVSMDV RRS