Gene Francci3_1226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1226 
Symbol 
ID3902971 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1467635 
End bp1468891 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content72% 
IMG OID637878559 
Productexodeoxyribonuclease I subunit D 
Protein accessionYP_480333 
Protein GI86739933 
COG category[L] Replication, recombination and repair 
COG ID[COG0420] DNA repair exonuclease 
TIGRFAM ID[TIGR00619] exonuclease SbcD 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0802336 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATAACCG GAACAGCCAG AAAGGCGCGC AGTTCGGACC CCCATGACGA TCATCTCGAC 
GGGATTGCCT ACCGTGCTCG CATGCGTGCG CTGCACACGT CCGACTGGCA TCTCGGGCGT
GGCCTGTACG GTCACGACCT GATGCCGGCC CAGGCGGCCT TCGTCGATCA TCTCGTCGAC
GTCGTCCGTT CCGAGGGCGT TGACGTCGTG CTCATCGCCG GCGACGTGCA CGATCGGGCG
ATCCCGCCGG TGGGCGCGCT GGAGCTCTTC GACGAGGCGC TCTCCCGACT GCGCGATGCC
GGCGCCCGGG TCGTGGTGAT CAGTGGCAAC CACGACGCGG CCCGCCGGCT CGGCGACAAG
GCCGGCCTGC TCGACCCCCG CGTCCGCATC CGGACGGATC CGGCCGCGGT CGGGGATCCG
GTCGTCGTCG AGGATCCCGC CGGGGCGGTC CGGGTGTACG CGATCCCCTA CCTGGAGCCG
TCGGCGGCGA ACTCCCAGCT TCCCGAACCG GCGCAGGCGC CATCTGGCTC GGACGTCCCG
GCCGCCGGGA TCCCCGCCGC GACGATGCAC CGGGCGATGC ACGCGGTGCG GGCCGACCTC
GCACGGTATC CGGATGCCCG GTCCGTCGTG GTGGCGCACG CCTGGGTCAC CGGCGGGGCG
GCGAGCGAGA GCGAACGCGA CATCAGCGTC GGCGGGGTGG GCAATGTGCC GGCCCGGTTG
TTCGAGGGGA TCACCTATAC CGCGCTCGGC CATCTGCACC GGCCACAGGC GATCGCCCCG
TCCGTCCGCT ACAGCGGATC GCCACTGGCC TACTCCTTCT CGGAGTCCGG CGACGCGAAA
GCGTCGCTGC TCGTCGAGAT CGGTCCGACC GGGCTGGGGA ACGTGACCCG CATCGGCGTT
CCCGCCCGGC GGCGGATGAC CCTGCTGCGC GGCAGTCTTG CCGACCTCCT CACCGACCCC
GCCCATGCCC CGCACGAGGC CGACTTCGTC TCCGCCGTGC TGACCGACCC CGTCCGCCCC
ATGGACGCCA TGGCCCGGTT GCAGCACCGG TTCCCCTTCG CCCTACGACT CGCGCACGAA
CCGGAGACGG AACCAGACGA GATACTCAGC TTCGGCCGGC GGACCCGGGG ACGCTCGGAG
CTGGAGATCG CCGAGGCCTT CGTCGCCCAT GTACGCAGCG CTCCCTCGGC TCGGGAACGT
GCTCTGCTCG CCGAAGCCCT CGGCGCCGCC CGCCGGGCCG AGGAGGAAGT CGCCTGA
 
Protein sequence
MITGTARKAR SSDPHDDHLD GIAYRARMRA LHTSDWHLGR GLYGHDLMPA QAAFVDHLVD 
VVRSEGVDVV LIAGDVHDRA IPPVGALELF DEALSRLRDA GARVVVISGN HDAARRLGDK
AGLLDPRVRI RTDPAAVGDP VVVEDPAGAV RVYAIPYLEP SAANSQLPEP AQAPSGSDVP
AAGIPAATMH RAMHAVRADL ARYPDARSVV VAHAWVTGGA ASESERDISV GGVGNVPARL
FEGITYTALG HLHRPQAIAP SVRYSGSPLA YSFSESGDAK ASLLVEIGPT GLGNVTRIGV
PARRRMTLLR GSLADLLTDP AHAPHEADFV SAVLTDPVRP MDAMARLQHR FPFALRLAHE
PETEPDEILS FGRRTRGRSE LEIAEAFVAH VRSAPSARER ALLAEALGAA RRAEEEVA